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Description 

IDENTIFICATION OF NOVEL MS4A GENE FAMILY MEMBERS 



EXPRESSED BY HEMATOPOIETIC CELLS 



5 



Cross Reference to Related Applications 



This application is based on and claims priority to United States 
Provisional Application Serial Number 60/254,362, filed December 8, 2000, 
and United States Provisional Application Serial No. 60/270,057 filed 
February 20, 2001 , herein incorporated by reference in their entirety. 



This work was supported by NIH grants CA-81776 and CA-54464. 
Thus, the U.S. Government has rights in the invention. 



The present invention generally relates to a new class of MS4A 
proteins characterized by a membrane-embedded structure. More 
particularly, the present invention provides MS4A nucleic acid and 
polypeptide sequences, chimeric genes comprising disclosed MS4A 
20 sequences, antibodies that specifically recognize MS4A polypeptides, and 
uses thereof. 



10 



Grant Statement 



15 



Field of the Invention 



Table of Abbreviations 



ATCC 



American Tissue Culture Collection 



30 



25 



CD20 



MS4A family 



GFP 



FceRip 



htgs 



HTm4 



CD20 B lymphocyte differentiation antigen 
high-affinity IgE receptor p chain 
green fluorescent protein 
GenBank human genomic database 
hematopoietic CD20-like antigen 
membrane spanning 4-domain family, 
subfamily A 
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Background Art 

CD20, FceRIp, and HTm4 are three cell surface proteins expressed 
by hematopoietic cells that represent members of a nascent gene family 
• (Adra et al. (1999) Clin Genet 55:431-437, Kinet (1999) Annu Rev Immunol 
5 17:931-972; Tedder and Engel (1994) Immunol Today 15:450-454). The 
deduced amino acid sequence of human and mouse CD20 first 
demonstrated a cell surface protein containing four membrane-spanning 
regions, N- and C-terminal cytoplasmic domains, and an -50 amino acid 
loop that serves as the extracellular domain (Einfeld et al. (1988) EMBO J 

10 7:711-717; Stamenkovic and Seed (1988) J Exp Med 167:1975-1980; 
Tedder et al. (1988a) J Immunol 141:4388-4394; Tedder et al. (1988b) Proc 
Natl Acad Sci USA 85:208-212). Human CD20 shares 20% amino acid 
sequence identity with FcsRip and HTm4 (Adra et al. (1994) Proc Natl Acad 
Sci USA 91:10178-10182, Kuster et al. (1992) J Biol Chem 267:12782- 

15 12787). Moreover, these three proteins have a similar overall structure in 
man, mouse, and rat with significant sequence identity within the first three 
membrane-spanning domains (Kinet et al. (1988) Proc Natl Acad Sci USA 
85:6483-6487; Ra et al. (1989) Nature 19:1771-7; Tedder et al., 1988a). In 
addition, all three genes are located in the same region of human 

20 chromosome 11q12-13.1 (Adra et al., 1994; Hupp et al. (1989) J Immunol 
143:3787-3791; Tedder et al. (1989a) J Immunol 142:2555-2559) and 
mouse chromosome 19 (Hupp et al. 1989; Tedder et al., 1988a). These 
three genes are therefore likely to have evolved from a common precursor. 
Despite structural and sequence conservation between CD20, FceRip 

25 and HTm4, transcription of each gene is differentially regulated. CD20 is 
only expressed by B lymphocytes (Stashenko et al. (1980) J Immunol 
125:1678-1685; Tedder et al., 1988a). FceRIp is expressed by mast cells 
and basophils (Kinet, 1999). HTm4 is expressed by diverse lymphoid and 
myeloid origin hematopoietic cells (Adra et al., 1994). 

30 Although the function of HTm4 remains unexplored, CD20 and FceRIp 

have critical roles in cell signaling. CD20 forms a homo- or hetero-tetrameric 
complex that is functionally important for regulating cell cycle progression 
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and signal transduction in B lymphocytes (Tedder and Engel, 1994). CD20 
additionally regulates transmembrane Ca ++ conductance, possibly as a 
functional component of a Ca ++ -permeable cation channel (Bubien et al. J 
Cell Biol 121:1121-1132; Kanzaki et al. (1997a) J Biol Chem 272:14733- 
5 14739; Kanzaki et al. (1997b) J Biol Chem 272:4964-4969; Kanzaki et al. 

(1995) J Biol Chem 270:13099-13104). FceRIp is part of a tetrameric 
receptor complex consisting of a, p, and two y chains (Blank et al. (1989) 
Nature 337:187-189). FceRIp mediates interactions with IgE-bound antigens 
that lead to cellular responses such as the degranulation of mast cells. 

10 Specifically, the FcsRip subunit functions as an amplifier of FcsRip-mediated 
activation signals (Dombrowicz et al. (1998) Immunity 8:517-529; Lin et al. 

(1996) Cell 85:985-995). Because of their unique structure and sequence 
homology, CD20, FceRIp, and HTm4 are likely to share overlapping 
functional properties. 

15 CD20 and FcsRip are also important clinically. Antibodies against 

CD20 are effective in treating non-Hodgkin's lymphoma (McLaughlin et al. 
(1998) Oncology 12:1763-1769; Onrust et al. (1989) J Biol Chem 
264:15323-15327; Weiner (1999) Semin Oncol 26:43-51). Genetic 
variations at chromosome 1 1q12-13 can also play a role in the pathogenesis 

20 of allergic diseases (Adra et al., 1999; Kinet, 1999). Recent studies suggest 
that FcsRip contributes to such diseases, and other genetic elements in this 
region likely also contribute to allergic disease. 

Since CD20, FceRIp, and HTm4 are likely to have evolved by 
duplication of an ancestral gene, other related proteins might exist that form 

25 additional receptor complexes. In view of the clinical importance noted 
above, the identification of such proteins thus represents a long-felt and 
ongoing need in the art. To address this need, applicants have identified 
novel human and mouse proteins that span the cell membrane at least four 
times and share high levels of amino acid sequence identity with CD20, 

30 FceRIp, and HTm4. This finding reveals a new gene family that has been 
designated herein as the MS4A family (membrane spanning 4-domain 
family, subfamily A). Currently this family contains at least 10 subgroups 
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(MS4A1 through MS4A12) that encode at least 21 previously unidentified 
human and mouse proteins expressed by hematopoietic cells and by diverse 
cell types in non-hematopoietic tissues. 

5 Summary of the Invention 

The present invention discloses isolated MS4A polypeptides and 
isolated nucleic acid molecules encoding the same. Preferably, an isolated 
MS4A polypeptide, or functional portion thereof, comprises a polypeptide 
encoded by the nucleic acid molecule of any one of the odd numbered SEQ 

10 ID NOs:1-37 a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered SEQ ID NOs:1-37, a 
polypeptide fragment encoded by a 20 nucleotide sequence that is identical 
to a contiguous 20 nucleotide sequence of any one of the odd-numbered 
. SEQ ID NOs:1-37, a polypeptide having an amino acid sequence of any one 

15 of the even-numbered SEQ ID NOs:2-38, a polypeptide that is a biological 
equivalent of any one of the even-numbered SEQ ID NOs:2-38, or a 
polypeptide that is immunologically cross-reactive with an antibody that 
shows specific binding with a polypeptide comprising some or all amino 
acids of any one of the even-numbered SEQ ID NOs:2-38. 

20 The present invention further teaches chimeric genes having a 

heterologous promoter that drives expression of a nucleic acid sequence 
encoding a MS4A polypeptide. Preferably, the chimeric gene is carried in a 
vector and introduced into a host cell so that a MS4A polypeptide of the 
present invention is produced. Preferred host cells include but are not 

25 limited to a bacterial cell, a hamster cell, a mouse cell, or a human cell. 

In another aspect of the invention, a method is provided for detecting 
a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is hybridized under 
stringent hybridization conditions to a MS4A nucleic acid molecule of the 

30 present invention. Such hybridization enables a nucleic acid molecule of the 
biological sample and the MS4A nucleic acid molecule to form a detectable 
duplex structure. Preferably, the MS4A nucleic acid molecule includes some 
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or all nucleotides of any one of the odd-numbered SEQ ID NOs:1-37. Also 
preferably, the biological sample comprises human nucleic acid material. 

The present invention further teaches an antibody that specifically 
recognizes a MS4A polypeptide. Preferably, the antibody recognizes some 
5 or all amino acids of any one of the even-numbered SEQ ID NOs:2-38. A 
method for producing a MS4A antibody is also disclosed, and the method 
comprises recombinantly or synthetically producing a MS4A polypeptide, or 
portion thereof; formulating the MS4A polypeptide so that it is an effective 
immunogen; immunizing an animal with the formulated polypeptide to 

10 generate an immune response that includes production of MS4A antibodies; 
and collecting blood serum from the immunized animal containing antibodies 
that specifically recognize a MS4A polypeptide. Antibody-producing cells 
can be optionally fused with an immortal cell line whereby a monoclonal 
antibody that specifically recognizes a MS4A polypeptide can be selected. 

15 Preferably, the MS4A polypeptide used as an immunogen includes some or 
all amino acid sequences of any one the even-numbered SEQ ID NOs:2-38. 

A method is also provided for detecting a level of MS4A polypeptide 
using an antibody that specifically recognizes a MS4A polypeptide. 
According to the method, a biological sample is obtained from an 

20 experimental subject and a control subject, and a MS4A polypeptide is 
detected in the sample by immunochemical reaction with the MS4A 
antibody. Preferably, the antibody recognizes amino acids of any one of the 
even-numbered SEQ ID NOs:2-38, and is prepared according to a method 
of the present invention for producing such an antjbody. 

25 The present invention further discloses a method for identifying a 

compound that modulates MS4A function. The method comprises: exposing 
an isolated MS4A polypeptide to one or more compounds, and assaying 
binding of a compound to the isolated MS4A polypeptide. A compound is 
selected that demonstrates specific binding to the isolated MS4A 

30 polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
numbered SEQ ID NOs:2-38. 
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Also provided is a method for identifying a regulator of MS4A gene 
expression. The method comprises (a) exposing a cell sample with a 
candidate compound to be tested, the cell sample containing at least one 
cell containing a DNA construct comprising a modulatable transcriptional 
5 regulatory sequence of a MS4A-encoding nucleic acid and a reporter gene 
which is capable of producing a detectable signal; (b) evaluating an amount 
of signal produced in relation to a control sample; and (c) identifying a 
candidate compound as a modulator of MS4A gene expression based on the 
amount of signal produced in relation to a control sample. Preferably, the 

10 modulatable transcriptional regulatory sequence of a MS4A-encoding 
nucleic acid comprises a sequence that is immediately upstream of the initial 
coding region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

The present invention further provides a method for modulating MS4A 
function in a subject. According to the method, a pharmaceutical 

15 composition is prepared that includes a substance capable of modulating 
MS4A expression or function, and a carrier. An effective dose of the 
pharmaceutical composition is administered to a subject, whereby MS4A 
activity is altered in the subject. Provided are therapeutic methods wherein 
a change in MS4A activity comprises a shift in the abundance of cell 

20 subpopulations expressing said protein, modulation of [Ca 2+ ]j levels, or 
altered cell function. In a preferred embodiment, the substance used to 
perform this method shows specific binding to some or all amino acids of 
any one of the even-numbered SEQ ID NOs:2-38, and was discovered by a 
method of the present invention. In another embodiment, MS4A function is 

25 disrupted by immunizing a subject with an effective dose of the disclosed 
MS4A polypeptide. The immune system of the subject produces an 
antibody that specifically recognizes the MS4A polypeptide, and preferably 
recognizes some or all of amino acids of any one of the even-numbered 
SEQ ID NOs:2-38. In a further embodiment, a gene therapy vector is used, 

30 the vector comprising a nucleotide sequence encoding a MS4A polypeptide. 
Alternatively, the gene therapy vector comprises a nucleotide sequence 
encoding a nucleic acid molecule, a peptide, or a protein that interacts with a 
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MS4A nucleic acid or polypeptide. Preferably, the subject is a human 
subject. 

Accordingly, it is an object of the present invention to provide novel 
MS4A nucleic acid and polypeptide sequences, and novel methods relating 
5 thereto. This object is achieved in whole or in part by the present invention. 

An object of the invention having been stated above, other objects 
and advantages of the present invention will become apparent to those 
skilled in the art after a study of the following description of the invention, 
Figures and non-limiting Examples. 

10 

Brief Description of the Drawings 
Figure 1 depicts cDNAs encoded by fifteen new human or mouse 
MS4A gene products. Consensus sequences from cDNAs and overlapping 
ESTs are indicated by their GenBank Accession numbers. Representative 

1 5 full-length cDNAs for each gene product are shown, except for MS4a3 which 
was not full-length. 5' and 3' untranslated sequences are shown as 
horizontal lines with relative nucleotide lengths shown. Coding regions are 
shown as boxes with translation initiation and termination codons and their 
relative nucleotide locations shown. Poly(A) attachment signal sequences 

20 (AATAAA) are indicated when known. Deduced hydrophobic regions are 
shown as filled boxes with the predicted membrane-spanning domains 
shown as TM1-TM4. Additional hydrophobic regions in MS4A4 proteins are 
shown as shaded boxes. Sites of putative nucleotide polymorphisms in 
MS4A6A are indicated by two (X)s. 

25 Figure 2 depicts exon-intron organization of the human MS4A genes. 

The maps were constructed by aligning known and predicted MS4A cDNA 
sequences with human genomic sequences as described in Materials and 
Methods. Exons are shown as boxes with the predicted translation initiation 
codons (ATG), transmembrane domains (TM) and termination codons 

30 indicated on the top. All exon and intron distances are shown to scale. 
Gaps indicate where intron distances have not been determined for MS4A3, 
MS4A4A, and MS4A12. Two long introns present in MS4A4E are not to 
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scale but the intron lengths are indicated. Exon numbering for MS4A1, and 
MS4A2 is as published (Kuster et al., 1992; Tedder et al., 1988a; Tedder et 
al., 1988b). 

Figure 3 shows human MS4A4E protein and transcript sequences 
5 predicted from genomic DNA sequences. MS4A4E sequences are 
compared with human MS4A4A cDNA (disclosed herein) and genomic 
sequences. Gaps were introduced to provide optimal alignment. The boxed 
AAC sequence near the 5' end of the MS4A4A sequence indicates the 
length of the most 5' MS4A4A cDNA sequence. Sequences upstream of this 

10 are based on contiguous genomic DNA sequences. Nucleotide numbering 
is based on the MS4A4A cDNA sequence, disclosed herein. Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codons. Potential poly-A attachment signal sequences 

1 5 (AATAAA) are boxed . 

Figure 4 shows human MS4A6E protein and transcript sequences 
predicted from genomic DNA and overlapping cDNA sequences. 
Predicted/WS4>46£: transcript sequences are compared with human MS4A6A 
cDNA sequence (disclosed herein). Gaps were introduced in the nucleotide 

20 sequence to provide optimal alignment. The 5' end of both transcripts start 
at 3' splice-acceptor sites which demark the first translated exons for both 
genes. The 5' end of the putative MS4A6E transcript is based on genomic 
DNA sequence, while the predicted sequences starting at nucleotide 60 
were based on both genomic DNA sequences and overlapping cDNA 

25 sequences. A gap in the MS4A6A sequence is indicated where TM1/2 and 
TM2 exons are not found in MS4A6E transcripts. MS4A6A nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates the predicted translation 

30 termination codon for the MS4A6E protein. 

Figure 5 shows human MS4A10 protein and transcript sequences 
predicted from human genomic DNA sequences. MS4A10 nucleotide 
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sequence is compared with mouse MS4a10 cDNA sequence (disclosed 
herein). The 5' end of both transcripts start at 3' splice-acceptor sites which 
demark the first translated exons for both genes. MS4a10 nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 
5 . translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codon for the MS4A10 protein. Potential poly-A attachment 
signal sequences (AATAAA) are boxed. 

Figure 6 depicts a physical linkage map for the MS4A genes. A 

10 scheme for chromosome 11 structure is shown on the left with the mapped 
locations for MS4A1, MS4A2 and MS4A3 indicated. Representative human 
BAC clones are shown as vertical black bars with clone names shown on the 
top and clone size shown at the bottom. All distances are shown to the 
indicated scale. The distance between and spatial relationship of RP11- 

15 312N17 to the four other overlapping BACs shown at the bottom are 
unknown. Thin bars indicate continuous characterized (mapped or 
sequenced) regions of DNA that contain identified MS4A genes. When the 
relative position of this region of DNA is known relative to the representative 
BACs that are shown, the thin bars overlay the BAC. The mapped position 

20 of each MS4A gene is indicated on the right with the relative direction of 
gene translation indicated by arrows ( — ►). In some cases, approximate 
distances between MS4A genes (termination codons to the translation 
initiation codon for the next gene) are indicated in base pairs (bp). In some 
cases, approximate MS4A gene size is indicated showing the distance 

25 between predicted translation initiation codons and translation termination 
codons as show in Figure 7. 

Figure 7 depicts deduced amino acid sequences for CD20 (human 
A1, SEQ ID NO:40; mouse a1, SEQ ID NO:48), FceRIp (human A2, SEQ ID 
NO:42; mouse a2, SEQ ID NO:50), HTm4 (human A3, SEQ ID NO:44; 

30 mouse a3, SEQ ID NO:20), and 19 new MS4A (human) (even-numbered 
SEQ ID NOs:2-18, 46) and MS4a (mouse and pig) proteins (even-numbered 
SEQ ID NOs:22-38, 56). Gaps were introduced to optimize alignments. 
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Numbers represent predicted residue positions. The predicted membrane- 
spanning regions (TM1-TM4) are indicated. Predicted intron|exon splice 
junctions are indicated by vertical bars where information was available. 
Amino acids common to 10 or more proteins are shaded, indicates partial 
5 sequence for the MS4a3 protein. CD20, FceRip, and HTm4 sequences and 
known intron|exon borders (SEQ ID NOs:39-44, 47-50) are as published 
(Adra et al., 1994; Kuster et al., 1992; Ra et al. f 1989; Tedder et al., 1988a; 
Tedder et al., 1989b; Tedder et al., 1988b). MS4A12 represents a 
conceptual translation (SEQ ID NO:46) of a human colon mucosa cDNA 

10 sequence (GenBank AK000224, SEQ ID NO:45), and MS4a12 represents a 
conceptual translation (SEQ ID NO:56) of a homologous cDNA sequence 
from pig (GenBank AJ236932, SEQ ID NO:55). 

Figure 8 depicts UPGMA (unweighted pair group method using 
arithmetic averages) tree of deduced MS4A and MS4a protein sequences. 

15 Horizontal tree branch length is a measure of sequence relatedness. For 
example, MS4a4B and MS4a4C are the most similar in sequence, while 
CD20 (MS4A1) sequences were the most divergent from other family 
members. The MS4a12p sequence was from pig, while all other MS4a 
sequences were from mouse. The UPGMA tree was generated using 

20 Geneworks version 2.0 (IntelliGenetics, Inc., Mountain View, California, 
USA). 

Figure 9 shows immunofluorescent detection of CD20 expression 
during B cell development. Single cell suspensions of leukocytes were 
isolated from wild-type mice, stained using MB20-13 (visualized using a PE- 

25 conjugated, anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) 
monoclonal antibodies, and examined by two-color immunofluorescence 
staining with flow cytometry analysis. Quadrant gates indicate negative and 
positive populations of cells as determined using isotype-matched control 
monoclonal antibodies. The gated cell populations correspond to the cells 

30 described in Table 7, and are shown for reference. These results are 
representative of those obtained with six (6) two month-old wild type mice. 
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Figure 10 summarizes the strategy for targeted disruption of the 
mouse CD20 gene. 

Figure 10A shows genomic clones encoding CD20. 

Figure 10B shows the intron-exon organization of the wild typeCD20 
5 allele containing exons 5-8 (shaded squares). 

Figure 10C shows the structure of the CD20 targeting vector. 

Figure 10D shows the predicted structure of the CD20 allele after 
gene targeting in ES cells by homologous recombination. The EcoR V 
restriction site in exon 6 is deleted as indicated. 
10 Figure 10E presents Southern blot analysis of tail DNA from two wild 

type and four CD20' A mice. Genomic DNA was digested with EcoR V, 
transferred to nitrocellulose and hybridized with the 5' probe indicated in (D). 

Figure 10F shows PCR amplification of genomic DNA from wild type 
and CD20" A mice using primers that bind in exons 6 and 7. Amplification of 
15 glyceraldehyde-3-phosphate dehydrogenase (G3PDH) is shown as a 
positive control. 

Figure 10G shows PCR amplification of cDNA generated from splenic 
RNA of wild type and CD20" /_ mice. Each reaction mixture contained a 
sense primer that hybridized with sequences encoded by exon 3 and 
20 antisense primers that hybridized with either exon 6 or Neo r gene promoter 
sequences. 

Figures 10H and 101 show reactivity of the MB20-13 monoclonal 
antibody with CD20 cDNA-transfected (thick line) or untransfected (dashed 
line) 300.19 cells (Figure 10H) or Chinese Hamster Ovary (CHO) cells 
25 (Figure 101). The thin lines represent CD20 cDNA-transfected cells stained 
with secondary antibody alone or an isotype-control monoclonal antibody. 
Indirect immunofluorescence staining was visualized by flow cytometry 
analysis. 

Figure 10J shows immunofluorescent staining of splenocytes from 
30 CD20" /_ or wild type mice with MB20-13 (visualized using a PE-conjugated, 
anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) monoclonal 
antibodies. Splenocytes from CD20' A mice generated histograms identical to 
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those obtained without MB20-1 monoclonal antibody present, using the 
secondary antibody alone. 

Figure 11 depicts immunofluorescent detection of B lymphocyte 
subpopulations in CD20" A and wild type mice. Lymphocytes were isolated 
5 and examined by two color immunofluorescent staining with flow cytometry 
analysis. Quadrants delineated by squares indicate negative and positive 
populations of cells as determined using unreactive monoclonal antibody 
controls. The gated cell populations correspond to the cells described in 
Table 7 that represent at least 6 mice of each genotype. 

10 Figure 12 shows altered signal transduction in CD20~ A B cells. Figure 

12 also shows CD19 expression by splenocytes from CD20" /_ (thin line) and 
wild type (thick line) mice. Immunofluorescence staining using PE- 
conjugated anti-CD19 monoclonal antibody with flow cytometry analysis. 
The dashed line represents staining of wild type splenocytes with a control 

15 antibody. 

Figure 12A presents calcium responses induced by BCR or CD19 
ligation in CD20~ / " and wild type B cells. Splenocytes were loaded with 1 pM 
indo-1-AM ester and B cells were stained with FITC-conjugated anti-B220 
antibody. At 1 min (arrow), optimal concentrations of goat anti-IgM F(ab') 2 

20 antibody fragments, anti-CD19 monoclonal antibody or Thapsigargin were 
added, with or without EGTA present. Increased ratios of indo-1 
fluorescence indicate increased [Ca 2+ ]j. Results represent those from at 
least four experiments. 

Figure 12B presents assays of tyrosine phosphorylation of proteins 

25 from purified splenic B cells of CD20" /_ and wild type mice. B cells (2 x 
10 7 /sample) were incubated with anti-IgM antibody for the times shown and 
detergent lysed. Proteins were resolved by SDS-PAGE, transferred to 
nitrocellulose and immunoblotted with anti-phosphotyrosine (anti-PTyr) 
antibody. The blot was stripped and reprobed with anti-SHP-1 antibody as a 

30 control for equivalent protein loading. Western blots from two of three 
experiments are shown to demonstrate the range of results. 
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Detailed Description of the Invention 
The present invention provides isolated nucleic acids encoding MS4A 
polypeptides (representative embodiments set forth as the odd-numbered 
SEQ ID NOs:1-37), isolated MS4A polypeptides (representative 
5 embodiments set forth as the even-numbered SEQ ID NOs:2-38), and uses 
thereof. The disclosed MS4A nucleic acids and polypeptides can be used 
according to methods of the present invention for drug discovery screens, for 
therapeutic treatment of atopic conditions, and for therapeutic regulation of 
[Ca 2+ ]i levels, among other uses. 

10 

JL DEFINITIONS 

While the following terms are believed to be well understood by one 
of ordinary skill in the art, the following definitions are set forth to facilitate 
explanation of the invention. The entire contents of all publications 
15 mentioned herein, including the discussion of the background art presented 
above, are hereby fully incorporated by reference. 
I .A. MS4A nucleic acids 

The nucleic acid molecules provided by the present invention include 
the isolated nucleic acid molecules of any one of the odd-numbered SEQ ID 

20 NOs:1-37, sequences substantially similar to sequences of any one of the 
odd-numbered SEQ ID NOs:1-37, conservative variants thereof, 
subsequences and elongated sequences thereof, complementary DNA 
molecules, and corresponding RNA molecules. The present invention also 
encompasses genes, cDNAs, chimeric genes, and vectors comprising 

25 disclosed MS4A nucleic acid sequences. 

The term "nucleic acid molecule" refers to deoxyribonucleotides or 
ribonucleotides and polymers thereof in either single- or double-stranded 
form. Unless specifically limited, the term encompasses nucleic acids 
containing known analogues of natural nucleotides which have similar 

30 properties as the reference natural nucleic acid. Unless otherwise indicated, 
a particular nucleotide sequence also implicitly encompasses conservatively 
modified variants thereof (e.g. degenerate codon substitutions), 
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complementary sequences, subsequences, elongated sequences, as well as 
the sequence explicitly indicated. The terms "nucleic acid molecule" or 
"nucleotide sequence" can also be used in place of "gene", "cDNA", or 
"mRNA". Nucleic acids can be derived from any source, including any 
5 organism. 

The term "isolated", as used in the context of a nucleic acid molecule, 
indicates that the nucleic acid molecule exists apart from its native 
environment and is not a product of nature. An isolated DNA molecule can 
exist in a purified form or can exist in a non-native environment such as a 

10 transgenic host cell. 

The term "purified", when applied. to a nucleic acid, denotes that the 
nucleic acid is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a purified nucleic acid molecule 
is a homogeneous dry or aqueous solution. The term "purified" denotes that 

15 a nucleic acid or protein gives rise to essentially one band in an 
electrophoretic gel. Particularly, it means that the nucleic acid is at least 
about 50% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

The term "substantially identical", the context of two nucleotide or 

20 amino acid sequences, can also be defined as two or more sequences or 
subsequences that have at least 60%, preferably 80%, more preferably 90- 
95%, and most preferably at least 99% nucleotide or amino acid sequence 
identity, when compared and aligned for maximum correspondence, as 
measured using one of the following sequence comparison algorithms 

25 (described herein below under the heading Nucleotide and Amino Acid 
Sequence Comparisons ) or by visual inspection. Preferably, the substantial 
identity exists in nucleotide sequences of at least 50 residues, more 
preferably in nucleotide sequence of at least about 100 residues, more 
preferably in nucleotide sequences of at least about 150 residues, and most 

30 preferably in nucleotide sequences comprising complete coding sequences. 
In one aspect, polymorphic sequences can be substantially identical 
sequences. The term "polymorphic" refers to the occurrence of two or more 
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genetically determined alternative sequences or alleles in a population. An 
allelic difference can be as small as one base pair. 

Another indication that two nucleotide sequences are substantially 
identical is that the two molecules specifically or substantially hybridize to 
5 each other under stringent conditions. In the context of nucleic acid 
hybridization, two nucleic acid sequences being compared can be 
designated a "probe" and a "target". A "probe" is a reference nucleic acid 
molecule, and a "'target" is a test nucleic acid molecule, often found within a 
heterogenous population of nucleic acid molecules. A "target sequence" is 

1 0 synonymous with a "test sequence". 

A preferred nucleotide sequence employed for hybridization studies 
or assays includes probe sequences that are complementary to or mimic at 
least an about 14 to 40 nucleotide sequence of a nucleic acid molecule of 
the present invention. Preferably, probes comprise 14 to 20 nucleotides, or 

15 even longer where desired, such as 30, 40, 50, 60, 100, 200, 300, or 500 
nucleotides or up to the full length of any of those set forth as the odd- 
numbered SEQ ID NOs:1-37. Such fragments can be readily prepared by, 
for example, directly synthesizing the fragment by chemical synthesis, by 
application of nucleic acid amplification technology, or by introducing 

20 selected sequences into recombinant vectors for recombinant production. 

The phrase "hybridizing specifically to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide 
sequence under stringent conditions when that sequence is present in a 
complex nucleic acid mixture (e.g., total cellular DNA or RNA). The phrase 

25 "binds substantially to" refers to complementary hybridization between a 
probe nucleic acid molecule and a target nucleic acid molecule and 
embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired hybridization. 

"Stringent hybridization conditions" and "stringent hybridization wash 

30 conditions" in the context of nucleic acid hybridization experiments such as 
Southern and Northern blot analysis are both sequence- and environment- 
dependent. Longer sequences hybridize specifically at higher temperatures. 
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An extensive guide to the hybridization of nucleic acids is found in Tijssen 
(1993) Laboratory Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes , part I chapter 2, Elsevier, New York, 
New York. Generally, highly stringent hybridization and wash conditions are 
5 selected to be about 5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. Typically, under 
"stringent conditions" a probe will hybridize specifically to its target 
subsequence, but to no other sequences. 

The T m is the temperature (under defined ionic strength and pH) at 

10 which 50% of the target sequence hybridizes to a perfectly matched probe. 
Very stringent conditions are selected to be equal to the T m for a particular 
probe. An example of stringent hybridization conditions for Southern or 
Northern Blot analysis of complementary nucleic acids having more than 
about 100 complementary residues is overnight hybridization in 50% 

15 formamide with 1 mg of heparin at 42°C. An example of highly stringent 
wash conditions is 15 minutes in 0.1 5 M NaCI at 65°C. An example of 
stringent wash conditions is 15 minutes in 0.2X SSC buffer at 65°C ( See 
Sambrook et al. eds. (1989) Molecular Cloning: A Laboratory Manual. Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York for a 

20 description of SSC buffer). Often, a high stringency wash is preceded by a 
low stringency wash to remove background probe signal. An example of 
medium stringency wash conditions for a duplex of more than about 100 
nucleotides, is 15 minutes in 1X SSC at 45°C. An example of low stringency 
wash for a duplex of more than about 100 nucleotides, is 15 minutes in 4-6X 

25 SSC at 40°C. For short probes (e.g., about 10 to 50 nucleotides), stringent 
conditions typically involve salt concentrations of less than about 1.0 M Na + 
ion, typically about 0.01 to 1.0 M Na + ion concentration (or other salts) at pH 
7.0-8.3, and the temperature is typically at least about 30°C. Stringent 
conditions can also be achieved with the addition of destabilizing agents 

30 such as formamide. In general, a signal to noise ratio of 2-fold (or higher) 
than that observed for an unrelated probe in the particular hybridization 
assay indicates detection of a specific hybridization. 
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The following are examples of hybridization and wash conditions that 
can be used to clone homologous nucleotide sequences that are 
substantially identical to reference nucleotide sequences of the present 
invention: a probe nucleotide sequence preferably hybridizes to a target 
5 nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4l 1 
mM EDTA at 50°C followed by washing in 2X SSC, 0.1 % SDS at 50°C; more 
preferably, a probe and target sequence hybridize in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 1X 
SSC, 0.1% SDS at 50°C; more preferably, a probe and target sequence 

10 hybridize in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4l 1 mM EDTA at 
50°C followed by washing in 0.5X SSC, 0.1% SDS at 50°C; more preferably, 
a probe and target sequence hybridize in 7% sodium dodecyl sulfate (SDS), 
0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 0.1X SSC, 0.1% 
SDS at 50°C; more preferably, a probe and target sequence hybridize in 7% 

15 sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed 
by washing in 0.1X SSC, 0.1% SDS at 65°C. 

A further indication that two nucleic acid sequences are substantially 
identical is that proteins encoded by the nucleic acids are substantially 
identical, share an overall three-dimensional structure, are biologically 

20 functional equivalents, or are immunologically cross-reactive. These terms 
are defined further under the heading MS4A Polypeptides herein below. 
Nucleic acid molecules that do not hybridize to each other under stringent 
conditions are still substantially identical if the corresponding proteins are 
substantially identical. This can occur, for example, when two nucleotide 

25 sequences are significantly degenerate as permitted by the genetic code. 

The term "conservatively substituted variants" refers to nucleic acid 
sequences having degenerate codon substitutions wherein the third position 
of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al. (1991) Nucleic Acids Res 19:5081; 

30 Ohtsuka et al. (1985) J Biol Chem 260:2605-2608; Rossolini et al. (1994) 
Mol Cell Probes 8:91-98). 
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The term "subsequence" refers to a sequence of nucleic acids that 
comprises a part of a longer nucleic acid sequence. An exemplary 
subsequence is a probe, described herein above, or a primer. The term 
"primer" as used herein refers to a contiguous sequence comprising about 8 
5 or more deoxyribonucleotides or ribonucleotides, preferably 1 0-20 
nucleotides, and more preferably 20-30 nucleotides of a selected nucleic 
acid molecule. The primers of the invention encompass oligonucleotides of 
sufficient length and appropriate sequence so as to provide initiation of 
polymerization on a nucleic acid molecule of the present invention. 

10 The term "elongated sequence" refers to an addition of nucleotides 

(or other analogous molecules) incorporated into the nucleic acid. For 
example, a polymerase (e.g., a DNA polymerase), e.g., a polymerase which 
adds sequences at the 3' terminus of the nucleic acid molecule. In addition, 
the nucleotide sequence can be combined with other DNA sequences, such 

15 as promoters, promoter regions, enhancers, polyadenylation signals, intronic 
sequences, additional restriction enzyme sites, multiple cloning sites, and 
other coding segments. 

The term "complementary sequence", as used herein, indicates two 
nucleotide sequences that comprise antiparallel nucleotide sequences 

20 capable of pairing with one another upon formation of hydrogen bonds 
between base pairs. As used herein, the term "complementary sequences" 
means nucleotide sequences which are substantially complementary, as can 
be assessed by the same nucleotide comparison set forth above, or is 
defined as being capable of hybridizing to the nucleic acid segment in 

25 question under relatively stringent conditions such as those described 
herein. A particular example of a complementary nucleic acid segment is an 
antisense oligonucleotide. 

The term "gene" refers broadly to any segment of DNA associated 
with a biological function. A gene encompasses sequences including but not 

30 limited to a coding sequence, a promoter region, a cis-regulatory sequence, 
a non-expressed DNA segment is a specific recognition sequence for 
regulatory proteins, a non-expressed DNA segment that contributes to gene 
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expression, a DNA segment designed to have desired parameters, or 
combinations thereof. A gene can be obtained by a variety of methods, 
including cloning from a biological sample, synthesis based on known or 
predicted sequence information, and recombinant derivation of an existing 
5 sequence. 

The term "gene expression" generally refers to the cellular processes 
by which a biologically active polypeptide is produced from a DNA 
sequence. 

The present invention also encompasses chimeric genes comprising 

10 the disclosed MS4A sequences. The term "chimeric gene", as used herein, 
refers to a promoter region operably linked to a MS4A coding sequence, a 
nucleotide sequence producing an antisense RNA molecule, a RNA 
molecule having tertiary structure, such as a hairpin structure, or a double- 
stranded RNA molecule. 

15 The term "operably linked", as used herein, refers to a promoter 

region that is connected to a nucleotide sequence in such a way that the 
transcription of that nucleotide sequence is controlled and regulated by that 
promoter region. Techniques for operatively linking a promoter region to a 
nucleotide sequence are well known in the art. 

20 The terms "heterologous gene", "heterologous DNA sequence", 

"heterologous nucleotide sequence", "exogenous nucleic acid molecule", or 
"exogenous DNA segment", as used herein, each refer to a sequence that 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Thus, a heterologous gene in a 

25 host cell includes a gene that is endogenous to the particular host cell but 
has been modified, for example by mutagenesis or by isolation from native 
cis-regulatory sequences. The terms also include non-naturally occurring 
multiple copies of a naturally occurring nucleotide sequence. Thus, the 
terms refer to a DNA segment that is foreign or heterologous to the cell, or 

30 homologous to the cell but in a position within the host cell nucleic acid 
wherein the element is not ordinarily found. 

The term "promoter region" defines a nucleotide sequence within a 
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gene that is positioned 5' to a coding sequence of a same gene and 
functions to direct transcription of the coding sequence. The promoter 
region includes a transcriptional start site and at least one cis-regulatory 
element. The present invention encompasses nucleic acid sequences that 
5 comprise a promoter region of a MS4A gene, or functional portion thereof. 

The term "cis-acting regulatory sequence" or "cis-regulatory motif or 
"response element", as used herein, each refer to a nucleotide sequence 
that enables responsiveness to a regulatory transcription factor. 
Responsiveness can encompass a decrease or an increase in 

10 transcriptional output and is mediated by binding of the transcription factor to 
the DNA molecule comprising the response element. 

The term "transcription factor" generally refers to a protein that 
modulates gene expression by interaction with the cis-regulatory element 
and cellular components for transcription, including RNA Polymerase, 

15 • Transcription Associated Factors (TAFs), chromatin-remodeling proteins, 
and any other relevant protein that impacts gene transcription. 

A "functional portion" of a promoter gene fragment is a nucleotide 
sequence within a promoter region that is required for normal gene 
transcription. To determine nucleotide sequences that are functional, the 

20 expression of a reporter gene is assayed when variably placed under the 
direction of a promoter region fragment. 

Promoter region fragments can be conveniently made by enzymatic 
digestion of a larger fragment using restriction endonucleases or DNAse I. 
Preferably, a functional promoter region fragment comprises about 5000 

25 nucleotides, more preferably 2000 nucleotides, more preferably about 1000 
nucleotides. Even more preferably a functional promoter region fragment 
comprises about 500 nucleotides, even more preferably a functional 
promoter region fragment comprises about 100 nucleotides, and even more 
preferably a functional promoter region fragment comprises about 20 

30 nucleotides. 

The terms "reporter gene" or "marker gene" or "selectable marker" 
each refer to a heterologous gene encoding a product that is readily 
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observed and/or quantitated. A reporter gene is heterologous in that it 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Non-limiting examples of 
detectable reporter genes that can be operably linked to a transcriptional 
5 regulatory region can be found in Alam & Cook (1990) Anal Biochem 
188:245-254 and PCT International Publication No. WO 97/47763. 
Preferred reporter genes for transcriptional analyses include the lacZ gene 
( See , e.g., Rose & Botstein (1983) Meth Enzymol 101:167-180), Green 
Fluorescent Protein (GFP) (Cubitt et al. (1995) Trends Biochem Sci 20:448- 

10 455), luciferase, or chloramphenicol acetyl transferase (CAT). Preferred 
reporter genes for methods to produce transgenic animals include but are 
not limited to antibiotic resistance genes, and more preferably the antibiotic 
resistance gene confers neomycin resistance. Any suitable reporter and 
detection method can be used, and it will be appreciated by one of skill in 

15 the art that no particular choice is essential to or a limitation of the present 
invention. 

An amount of reporter gene can be assayed by any method for 
qualitatively or preferably, quantitatively determining presence or activity of 
the reporter gene product. The amount of reporter gene expression directed 

20 by each test promoter region fragment is compared to an amount of reporter 
gene expression to a control construct comprising the reporter gene in the 
absence of a promoter region fragment. A promoter region fragment is 
identified as having promoter activity when there is significant increase in an 
amount of reporter gene expression in a test construct as compared to a 

25 control construct. The term "significant increase", as used herein, refers to 
an quantified change in a measurable quality that is larger than the margin of 
error inherent in the measurement technique, preferably an increase by 
about 2-fold or greater relative to a control measurement, more preferably an 
increase by about 5-fold or greater, and most preferably an increase by 

30 about 10-fold or greater. 

The present invention further includes vectors comprising the 
disclosed MS4A sequences, including plasmids, cosmids, and viral vectors. 
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The term "vector", as used herein refers to a DNA molecule having 
sequences that enable its replication in a compatible host cell. A vector also 
includes nucleotide sequences to permit ligation of nucleotide sequences 
within the vector, wherein such nucleotide sequences are also replicated in a 
5 compatible host cell. A vector can also mediate recombinant production of a 
MS4A polypeptide, as described further herein below. Preferred vectors 
include but are not limited to pBluescript (Stratagene), pUC18, pBLCAT3 
(Luckow & Schutz (1987) Nucleic Acids Res 15:5490), pLNTK (Gorman et 
al. (1996) Immunity 5:241-252), and pBAD/glll (Stratagene). A preferred 

10 host cell is a mammalian cell; more preferably the cell is a Chinese hamster 
ovary cell, a HeLa cell, a baby hamster kidney cell, or a mouse cell; even 
more preferably the cell is a human cell. 

Nucleic acids of the present invention can be cloned, synthesized, 
recombinantly altered, mutagenized, or combinations thereof. Standard 

15 recombinant DNA and molecular cloning techniques used to isolate nucleic 
acids are well known in the art. Exemplary, non-limiting methods are 
described by Sambrook et al., eds. (1989); by Silhavy et al. (1984) 
Experiments with Gene Fusions , Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York; by Ausubel et al (1992) Current Protocols in 

20 Molecular Biology , John Wylie and Sons, Inc., New York, New York; and by 
Glover, ed. (1985) DNA Cloning: A Practical Approach , MRL Press, Ltd., 
Oxford, United Kingdom. Site-specific mutagenesis to create base pair 
changes, deletions, or small insertions are also well known in the art as 
exemplified by publications, see, e.g., Adelman et al., (1983) DNA 2:183; 

25 Sambrook et al. (1989). 

Sequences detected by methods of the invention can be detected, 
subcloned, sequenced, and further evaluated by any measure well known in 
the art using any method usually applied to the detection of a specific DNA 
sequence including but not limited to dideoxy sequencing, PCR, oligomer 

30 restriction (Saiki et al. (1985) Bio/Technology 3:1008-1012), allele-specific 
oligonucleotide (ASO) probe analysis (Conner et al. (1983) Proc Natl Acad 
Sci USA 80:278), and oligonucleotide ligation assays (OLAs) (Landgren et. 
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al. (1988) Science 241:1007). Molecular techniques for DNA analysis have 
been reviewed (Landgren et. al. (1988) Science 242:229-237). 
LB. MS4A Polypeptides 

The polypeptides provided by the present invention include the 
5 isolated polypeptides set forth as the even-numbered SEQ ID NOs:2-38, 
polypeptides substantially similar to the even-numbered SEQ ID NOs:2-38, 
MS4A polypeptide fragments, fusion proteins comprising MS4A amino acid 
sequences, biologically functional analogs, and polypeptides that cross-react 
with an antibody that specifically recognizes a MS4A polypeptide. 
10 The term "isolated", as used in the context of a polypeptide, indicates 

that the polypeptide exists apart from its native environment and is not a 
product of nature. An isolated polypeptide can exist in a purified form or can 
exist in a non-native environment such as, for example, in a transgenic host 
cell. 

15 The term "purified", when applied to a polypeptide, denotes that the 

polypeptide is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a polypeptide is a homogeneous 
solid or aqueous solution. Purity and homogeneity are typically determined 
using analytical chemistry techniques such as polyacrylamide gel 

20 electrophoresis or high performance liquid chromatography. A polypeptide 
which is the predominant species present in a preparation is substantially 
purified. The term "purified" denotes that a polypeptide gives rise to 
essentially one band in an electrophoretic gel. Particularly, it means that the 
polypeptide is at least about 50% pure, more preferably at least about 85% 

25 pure, and most preferably at least about 99% pure. 

The term "substantially identical" in the context of two or more 
polypeptides sequences is measured by (a) polypeptide sequences having 
about 35%, or 45%, or preferably from 45-55%, or more preferably 55-65%, 
or most preferably 65% or greater amino acids which are identical or 

30 functionally equivalent. Percent "identity" and methods for determining 
identity are defined herein below under the heading Nucleotide and Amino 
Acid Sequence Comparisons . 



WO 02/062946 



24 



PCT/US01/48437 



Substantially identical polypeptides also encompass two or more 
polypeptides sharing a conserved three-dimensional structure. 
Computational methods can be used to compare structural representations, 
and structural models can be generated and easily tuned to identify 
5 similarities around important active sites or ligand binding sites. See 
Henikoff et al. (2000) Electrophoresis 21 (9):1 700-1 706; Huang et al. (2000) 
Pac Symp Biocomput 230-241; Saqi et al. (1999) Bioinformatics 15(6):521- 
522; and Barton (1998) Acta Crystallogr D Biol Crystallogr 54:1 139-1 146. 

The term "functionally equivalent' in the context of amino acid 

10 sequences is well known in the art and is based on the relative similarity of 
the amino acid side-chain substituents. See Henikoff & Henikoff (2000) Adv 
Protein Chem 54:73-97. Relevant factors for consideration include side- 
chain hydrophobicity, hydrophilicity, charge, and size. For example, 
arginine, lysine, and histidine are all positively charged residues; that 

15 alanine, glycine, and serine are all of similar size; and that phenylalanine, 
tryptophan, and tyrosine all have a generally similar shape. By this analysis, 
described further herein below, arginine, lysine, and histidine; alanine, 
glycine, and serine; and phenylalanine, tryptophan, and tyrosine; are defined 
herein as biologically functional equivalents. 

20 In making biologically functional equivalent amino acid substitutions, 

the hydropathic index of amino acids can be considered. Each amino acid 
has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 

25 alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (- 
0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and 
arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
30 interactive biological function on a protein is generally understood in the art 
(Kyte et al. (1982) J Mol Biol 157:105.). It is known that certain amino acids 
can be substituted for other amino acids having a similar hydropathic index 
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or score and still retain a similar biological activity. In making changes based 
upon the hydropathic index, the substitution of amino acids whose 
hydropathic indices are within ±2 of the original value is preferred, those 
which are within ±1 of the original value are particularly preferred, and those 
5 within ±0.5 of the original value are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101 states that the greatest local average hydrophilicity of a protein, 
as governed by the hydrophilicity of its adjacent amino acids, correlates with 
10 its immunogenicity and antigenicity, i.e. with a biological property of the 
protein. It is understood that an amino acid can be substituted for another 
having a similar hydrophilicity value and still obtain a biologically equivalent 
protein. 

As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 
15 values have been assigned to amino acid residues: arginine (+ 3.0); lysine 
(+ 3.0); aspartate (+ 3.0+1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine 
(+ 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1); 
alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1 .8); isoleucine (-1 .8); tyrosine (-2.3); phenylalanine (-2.5); 
20 tryptophan (-3.4) . 

In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
25 more particularly preferred. 

The present invention also encompasses MS4A polypeptide 
fragments or functional portions of a MS4A polypeptide. Such functional 
portion need not comprise all or substantially all of the amino acid sequence 
of a native MS4A gene product. The term "functional" includes any 
30 biological activity or feature of MS4A, including immunogenicity. 

The present invention also includes longer sequences of a MS4A 
polypeptide, or portion thereof. For example, one or more amino acids can 
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be added to the N-terminus or C-terminus of a MS4A polypeptide. Fusion 
proteins comprising MS4A polypeptide sequences are also provided within 
the scope of the present invention. Methods of preparing such proteins are 
known in the art. 

5 The present invention also encompasses functional analogs of a 

MS4A polypeptide. Functional analogs share at least one biological function 
with a MS4A polypeptide. An exemplary function is immunogenicity. In the 
context of amino acid sequence, biologically functional analogs, as used 
herein, are peptides in which certain, but not most or all, of the amino acids 

10 can be substituted. Functional analogs can be created at the level of the 
corresponding nucleic acid molecule, altering such sequence to encode 
desired amino acid changes. In one embodiment, changes can be 
introduced to improve the antigenicity of the protein. In another 
embodiment, a MS4A polypeptide sequence is varied so as to assess the 

1 5 activity of a mutant MS4A polypeptide. 

The present invention also encompasses recombinant production of 
the disclosed MS4A polypeptides. Briefly, a nucleic acid sequence encoding 
a MS4A polypeptide, or portion thereof, is cloned into a expression cassette, 
the cassette is introduced into a host organism, where it is recombinantly 

20 produced. 

The term "expression cassette" as used herein Tneans a DNA 
sequence capable of directing expression of a particular nucleotide 
sequence in an appropriate host cell, comprising a promoter operably linked 
to the nucleotide sequence of interest which is operably linked to termination 

25 signals. It also typically comprises sequences required for proper translation 
of the nucleotide sequence. The expression cassette comprising the 
nucleotide sequence of interest can be chimeric. The expression cassette 
can also be one which is naturally occurring but has been obtained in a 
recombinant form useful for heterologous expression. 

30 The expression of the nucleotide sequence in the expression cassette 

can be under the control of a constitutive promoter or an inducible promoter 
which initiates transcription only when the host cell is exposed to some 



WO 02/062946 



27 



PCT/US01/48437 



particular external stimulus. Exemplary promoters include Simian virus 40 
early promoter, a long terminal repeat promoter from retrovirus, an action 
promoter, a heat shock promoter, and a metallothien protein. In the case of 
a multicellular organism, the promoter and promoter region can direct 
5 expression to a particular tissue or organ or stage of development. 
Exemplary tissue-specific promoter regions include a MS4A promoter, 
described herein. Suitable expression vectors which can be used include, 
but are not limited to, the following vectors or their derivatives: human or 
animal viruses such as vaccinia virus or adenovirus, yeast vectors, 
10 bacteriophage vectors (e.g., lambda phage), and plasmid and cosmids DNA 
vectors. 

The term "host cell", as used herein, refers to a cell into which a 
heterologous nucleic acid molecule has been introduced. Transformed cells, 
tissues, or organisms are understood to encompass not only the end product 
15 of a transformation process, but also transgenic progeny thereof. 

A host cell strain can be chosen which modulates the expression of 
the inserted sequences, or modifies and processes the gene product in the 
specific fashion desired. For example, different host cells have 
characteristic and specific mechanisms for the translational and post- 
20 transactional processing and modification (e.g., glycosylation, 
phosphorylation of proteins). Appropriate cell lines or host systems can be 
chosen to ensure the desired modification and processing of the foreign 
protein expressed. Expression in a bacterial system can be used to produce 
a non-glycosylated core protein product. Expression in yeast will produce a 
25 glycosylated product. Expression in animal cells can be used to ensure 
"native" glycosylation of a heterologous protein. 

Expression constructs are transfected into a host cell by any standard 
method, including electroporation, calcium phosphate precipitation, DEAE- 
Dextran transfection, liposome-mediated transfection, and infection using a 
30 retrovirus. The MS4A-encoding nucleotide sequence carried in the 
expression construct can be stably integrated into the genome of the host or 
it can be present as an extrachromosomal molecule. 
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Isolated polypeptides and recombinantly produced polypeptides can 
be purified and characterized using a variety of standard techniques that are 
well known to the skilled artisan. See, e.g. Ausubel et al. (1992), 
Bodanszky, et al. (1976) Peptide Synthesis , John Wiley and Sons, Second 
5 Edition, New York, New York and Zimmer et al. (1993) Peptides, pp. 393- 
394, ESCOM Science Publishers, B. V. 

I.C. Nucleotide and Amino Acid Sequence Comparisons 
The terms "identical" or percent "identity" in the context of two or more 
nucleotide or polypeptide sequences, refer to two or more sequences or 

10 subsequences that are the same or have a specified percentage of amino 
acid residues or nucleotides that are the same, when compared and aligned 
for maximum correspondence, as measured using one of the sequence 
comparison algorithms disclosed herein or by visual inspection.. 

The term "substantially identical" in regards to a nucleotide or 

15 polypeptide sequence means that a particular sequence varies from the 
sequence of a naturally occurring sequence by one or more deletions, 
substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of the natural gene, gene product, or sequence. Such 
sequences include "mutant" sequences, or sequences wherein the biological 

20 activity is altered to some degree but retains at least some of the original 
biological activity. The term "naturally occurring", as used herein, is used to 
describe a composition that can be found in nature as distinct from being 
artificially produced by man. For example, a protein or nucleotide sequence 
present in an organism, which can be isolated from a source in nature and 

25 which has not been intentionally modified by man in the laboratory, is 
naturally occurring. 

For sequence comparison, typically one sequence acts as a 
reference sequence to which test sequences are compared. When using a 
sequence comparison algorithm, test and reference sequences are entered 

30 into a computer program, subsequence coordinates are designated if 
necessary, and sequence algorithm program parameters are selected. The 
sequence comparison algorithm then calculates the percent sequence 
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identity for the designated test sequence(s) relative to the reference 
sequence, based on the selected program parameters. 

Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith & Waterman (1981) Adv Appl 
5 Math 2:482, by the homology alignment algorithm of Needleman & Wunsch 
(1970) J Mol Biol 48:443, by the search for similarity method of Pearson & 
Lipman (1988) Proc Natl Acad Sci USA 85:2444-2448, by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 

10 Madison, Wl), or by visual inspection. See generally, Ausubel et al., 1992. 

A preferred algorithm for determining percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et 
al. (1990) J Mol Biol 21 5: 403-410. Software for performing BLAST analyses 
is publicly available through the National Center for Biotechnology 

15 Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score 

20 threshold. These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are then 
extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score 

25 for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring 
matrix is used to calculate the cumulative score. Extension of the word hits 
in each direction are halted when the cumulative alignment score falls off by 
the quantity X from its maximum achieved value, the cumulative score goes 

30 to zero or below due to the accumulation of one or more negative-scoring 
residue alignments, or the end of either sequence is reached. The BLAST 
algorithm parameters W, T, and X determine the sensitivity and speed of the 
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alignment. The BLASTN program (for nucleotide sequences) uses as 
defaults a wordlength W=11, an expectation E=10, a cutoff of 100, M=5, N=- 
4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 
5 of 10, and the BLOSUM62 scoring matrix. See Henikoff & Henikoff (1989) 
Proc Natl Acad Sci USA 89:1 091 5. 

In addition to calculating percent sequence identity, the BLAST 
algorithm also performs a statistical analysis of the similarity between two 
sequences. See, e.g., Karlin and Altschul (1993) Proc Natl Acad Sci USA 

10 90:5873-5887. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a test nucleic acid 
sequence is considered similar to a reference sequence if the smallest sum 

1 5 probability in a comparison of the test nucleic acid sequence to the reference 
nucleic acid sequence is less than about 0.1, more preferably less than 
about 0.01 , and most preferably less than about 0.001 . 
I.D. Antibodies 

The present invention also provides an antibody that specifically binds 
20 a MS4A polypeptide. The term "antibody" indicates an immunoglobulin 
protein, or functional portion thereof, including a polyclonal antibody, a 
monoclonal antibody, a chimeric antibody, a single chain antibody, Fab 
fragments, and an Fab expression library. "Functional portion" refers to the 
part of the protein that binds a molecule of interest. In a preferred 
25 embodiment, an antibody of the invention is a monoclonal antibody. 
Techniques for preparing and characterizing antibodies are well known in 
the art ( See , e.g., Harlow & Lane (1988) Antibodies: A Laboratory Manual . 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York). A 
monoclonal antibody of the present invention can be readily prepared 
30 through use of well-known techniques such as the hybridoma techniques 
exemplified in U.S. Patent No 4,196,265 and the phage-displayed 
techniques disclosed in U.S. Patent No. 5,260,203. 
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The phrase "specifically (or selectively) binds to an antibody", or 
"specifically (or selectively) immunoreactive with", when referring to a protein 
or peptide, refers to a binding reaction which is determinative of the 
presence of the protein in a heterogeneous population of proteins and other 
5 biological materials. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not show significant 
binding to other proteins present in the sample. Specific binding to an 
antibody under such conditions can require an antibody that is selected for 
its specificity for a particular protein. For example, antibodies raised to a 

10 protein with an amino acid sequence encoded by any of the nucleic acid 
sequences of the invention can be selected to obtain antibodies specifically 
immunoreactive with that protein and not with unrelated proteins. 

The use of a molecular cloning approach to generate antibodies, 
particularly monoclonal antibodies, t and more particularly single chain 

15 monoclonal antibodies, are also provided. The production of single chain 
antibodies has been described in the art. See , e.g., U.S. Patent No. 
5,260,203. For this approach, combinatorial immunoglobulin phagemid 
libraries are prepared from RNA isolated from the spleen of the immunized 
animal, and phagemids expressing appropriate antibodies are selected by 

20 panning on endothelial tissue. The advantages of this approach over 
conventional hybridoma techniques are that approximately 10 4 times as 
many antibodies can be produced and screened in a single round, and that 
new specificities are generated by heavy (H) and light (L) chain 
combinations in a single chain, which further increases the chance of finding 

25 appropriate antibodies. Thus, an antibody of the present invention, or a 
"derivative" of an antibody of the present invention, pertains to a single 
polypeptide chain binding molecule which has binding specificity and affinity 
substantially similar to the binding specificity and affinity of the light and 
heavy chain aggregate variable region of an antibody described herein. 

30 The term "immunochemical reaction", as used herein, refers to any of 

a variety of immunoassay formats used to detect antibodies specifically 
bound to a particular protein, including but not limited to competitive and 
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non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel diffusion 
precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., 
5 using colloidal gold, enzyme or radioisotope labels), western blots, 
precipitation reactions, agglutination assays (e.g., gel agglutination assays, 
hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and Immunoelectrophoresis assays, etc. See 
Harlow & Lane (1988) for a description of immunoassay formats and 
10 conditions. 

I.E. Protein Binding Assays 

The term "binding" refers to an affinity between two molecules, for 
example, a ligand and a receptor. As used herein, "binding" means a 
preferential binding of one molecule for another in a mixture of molecules. 

15 The binding of the molecules can be considered specific if the binding affinity 
is about 1 x 10 4 IVT 1 to about 1 x 10 6 M" 1 or greater. Binding of two 
molecules also encompasses a quality or state of mutual action such that an 
activity of one protein or compound on another protein is inhibitory (in the 
case of an antagonist) or enhancing (in the case of an agonist). Exemplary 

20 protein binding assays include but are not limited to Fluorescence 
Correlation Spectroscopy (FCS), Surface-Enhanced Laser 
Desorption/lonization time-of-flight mass spectrometry (SELDI-TOF), and 
Biacore, each described further herein below. 

Fluorescence Correlation Spectroscopy (FCS) measures the average 

25 diffusion rate of a fluorescent molecule within a small sample volume 
(Madge et al. (1972) Phys Rev Lett 29:705-708; Maiti et al. (1997) Proc Natl 
Acad Sci USA, 94:11753-11757). The sample size can be as low as 10 3 
fluorescent molecules and the sample volume as low as the cytoplasm of a 
single bacterium. The diffusion rate is a function of the mass of the molecule 

30 and decreases as the mass increases. FCS can therefore be applied to 
protein-ligand interaction analysis by measuring the change in mass and 
therefore in diffusion rate of a molecule upon binding. In a typical 
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experiment, the target to be analyzed is expressed as a recombinant protein 
with a sequence tag, such as a poly-histidine sequence, inserted at the N- 
terminus or C-terminus. The expression takes place in E. co//, yeast or 
mammalian cells. The protein is purified using chromatographic methods. 
5 For example, the poly-histidine tag can be used to bind the expressed 
protein to a metal chelate column such as Ni 2+ chelated on iminodiacetic 
acid agarose. The protein is then labeled with a fluorescent tag such as 
carboxytetramethylrhodamine or BODIPY™ (Molecular Probes, Eugene, 
Oregon). The protein is then exposed in solution to the potential ligand, and 

10 its diffusion rate is determined by FCS using instrumentation available from 
Carl Zeiss, Inc. (Thornwood, New York). Ligand binding is determined by 
changes in the diffusion rate of the protein. 

. Surface-Enhanced Laser Desorption/lonization (SELDI) was 
developed by Hutchens & Yip (1993) Rapid Commun Mass Spectrom 7:576- 

15 580). When coupled to a time-of-flight mass spectrometer (TOF), SELDI 
provides a means to rapidly analyze molecules retained on a chip. It can be 
applied to ligand-protein interaction analysis by covalently binding the target 
protein on the chip and analyzing by MS the small molecules that bind to this 
protein (Worrall et al. (1998) Anal Biochem 70:750-756). In a typical 

20 experiment, the target to be analyzed , is expressed as described for FCS. 
The purified protein is then used in the assay without further preparation. It 
is bound to the SELDI chip either by utilizing the poly-histidine tag or by 
other interaction such as ion exchange or hydrophobic interaction. The chip 
thus prepared is then exposed to the potential ligand via, for example, a 

25 delivery system able to pipet the ligands in a sequential manner 
(autosampler). The chip is then washed in solutions of increasing 
stringency, for example a series of washes with buffer solutions containing 
an increasing ionic strength. After each wash, the bound material is 
analyzed by submitting the chip to SELDI-TOF. Ligands that specifically 

30 bind the target are identified by the stringency of the wash needed to elute 
them. 



WO 02/062946 



34 



PCT/US01/48437 



Biacore relies on changes in the refractive index at the surface layer 
upon binding of a ligand to a protein immobilized on the layer. In this 
system, a collection of small ligands is injected sequentially in a 2-5 
microliter cell, wherein the protein is immobilized within the cell. Binding is 
5 detected by surface plasmon resonance (SPR) by recording laser light 
refracting from the surface. In general, the refractive index change for a 
given change of mass concentration at the surface layer is practically the 
same for all proteins and peptides, allowing a single method to be applicable 
for any protein (Liedberg et al. (1983) Sensors Actuators 4:299-304; 

10 Malmquist (1993) Nature 361:186-187). In a typical experiment, the target to 
be analyzed is expressed as described for FCS. The purified protein is then 
used in the assay without further preparation. It is bound to the Biacore chip 
either by utilizing the poly-histidine tag or by other interaction such as ion 
exchange or hydrophobic interaction. The chip thus prepared is then 

15 exposed to the potential ligand via the delivery system incorporated in the 
instruments sold by Biacore (Uppsala, Sweden) to pipet the ligands in a 
sequential manner (autosampler). The SPR signal on the chip is recorded 
and changes in the refractive index indicate an interaction between the 
immobilized target and the ligand. Analysis of the signal kinetics of on rate 

20 and off rate allows the discrimination between non-specific and specific 
interaction. 

I.F. Transgenic animals 

It is also within the scope of the present invention to prepare a 
transgenic animal to mutagenize the MS4A locus or to express a transgene 

25 comprising nucleic acid sequences of the present invention. The term 
"transgenic animal", indicates an animal comprising a germline insertion of a 
heterologous nucleic acid. Transgenic animals of the present invention are 
understood to encompass not only the end product of a transformation 
method, but also transgenic progeny thereof. 

30 The term "transgene", as used herein indicates a heterologous 

nucleic acid molecule that has been transformed into a host cell. For 
intended use in the creation of a transgenic animal, the transgene includes 
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genomic sequences of the host organism at a selected locus or site of 
transgene integration to mediate a homologous recombination event. A 
transgene further comprises nucleic acid sequences of interest, for example 
a targeted modification of the gene residing within the locus, a reporter gene, 
5 or a expression cassette, each defined herein above. 

Transgene integration can be used to create gene mutations, 
including "knock-out", "knock-in", or a "knock-down" mutations. 
Representative approaches are disclosed in the Examples presented below. 
The term "knock-out" refers to a homologous recombination event that 

10 renders a gene inactive. Gene knock-out is generally accomplished by 
integration of the transgene at a chromosomal loci, thereby interrupting a 
gene residing at that loci. The term "knock-in" refers to in vivo replacement 
at a targeted locus. Knock-in mutations can modify a gene sequence to 
create a loss-of-f unction or gain-of-function mutation. The term "gene 

15 knock-down" refers to a homologous recombination event wherein the 
transgene partially eliminates gene function. A knock-down animal can be 
created by transgenic expression of an antisense molecule, wherein a 
transgene comprising the antisense sequence and a relevant promoter are 
integrated into the genome at a non-essential loci. Expression of the 

20 antisense or ribozyme molecule disrupts the corresponding gene function, 
although this disruption is generally incomplete (Luyckx et al. (1999) Proc 
Natl Acad Sci USA 96(21):12174-12179). 

Conditional mutation can be accomplished using transgenic methods 
in combination with the Cre-recombinase system in mice. Briefly, in one 

25 instance, a transgenic mouse is derived that expresses Cre-recombinase 
under the direction of an inducible promoter. A second transgenic mouse 
bears a mutation of a gene of interest as well as a lox-P-flanked endogenous 
gene sequence. Such transgenic mice are mated, the resulting progeny 
having both the Cre-recombinase and lox-P-flanked transgenes. Induction 

30 of Cre recombinase catalyzes excision of the lox-P-flanked transgene, 
thereby excising a portion of the endogenous gene sequence and revealing 
the mutated sequence. Conditional knockout can be varied according to the 
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temporal and spatial features of Cre recombinase expression, inherent in the 
selection of a promoter to drive Cre recombinase. See Postic et al. (1999) J 
Biol C/?e/77 275(1):305-315; and Sauer (1998) Methods 14(4):381-392. 

Transgenes can also be used for heterologous expression in a host 
5 organism without generating phenotypically apparent mutations. By this 
method, nucleotide sequences of interest are introduced into the genome at 
a nonessential loci, whereby insertion alone does not disrupt an essential 
gene function. Optionally, expression of the transgene can generate a gain- 
of-function or ectopic function phenotype. 

10 Techniques for the preparation of transgenic animals are known in the 

art. Exemplary techniques are described in U.S. Patent No. 5,489,742 
(transgenic rats); U.S. Patent Nos. 4,736,866, 5,550,316, 5,614,396, 
5,625,125 and 5,648,061 (transgenic mice); U.S. Patent No. 5,573,933 
(transgenic pigs); 5,162,215 (transgenic avian species) and U.S. Patent No. 

15 5,741,957 (transgenic bovine species). Briefly, nucleotide sequences of 
interest are cloned into a vector, and the construct is transformed into a 
germ cell. In the germ cell, a chromosomal rearrangement event takes 
place wherein the nucleic acid sequences of interest are integrated into the 
genome of the germ cell by homologous recombination. Fertilization and 

20 propagation of the transformed germ cell results in a transgenic animal. 
Homozygosity of the mutation is accomplished by intercrossing. 
I.G. Therapeutic Methods 

The present invention further provides methods for discovering 
substances that can be used as pharmaceutical compositions. The term 
25 "pharmaceutical composition" or "drug" as used herein, each refer to any 
substance having a biological activity. Substances discovered by methods 
of the present invention include but are not limited to polypeptide, proteins, 
peptides, chemical compounds, and antibodies. 

A composition of the present invention is typically formulated using 
30 acceptable vehicles, adjuvants, and carriers as desired. 

Among the acceptable vehicles and solvents that can be employed 
are water, Ringer's solution, and isotonic sodium chloride solution. In 
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addition, sterile, fixed oils are conventionally employed as a solvent or 
suspending medium. For this purpose any bland fixed oil can be employed 
including synthetic mono- or di-glycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectable compositions. 
5 Injectable preparations, for example sterile injectable aqueous or 

oleaginous suspensions, are formulated according to the known art using 
suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation can also be a sterile injectable solution or suspension 
in a nontoxic diluent or solvent, for example 1 ,3-butanediol. 

10 A vector can be used as a carrier, for example an adenovirus vector, 

can be used for gene therapy methods. The vector is purified to sufficiently 
render it essentially free of undesirable contaminants, such as defective 
interfering adenovirus particles or endotoxins and other pyrogens such that it 
does not cause any untoward reactions in the individual receiving the vector 

15 construct. A preferred means of purifying the vector involves the use of 
buoyant density gradients, such as cesium chloride gradient centrifugation. 

A transfected cell can also serve as a carrier. By way of example, a 
liver cell can be removed from an organism, transfected with a nucleic acid 
sequence of the present invention using methods set forth above and then 

20 the transfected cell returned to the organism (e.g. injected intra-vascularly). 

Monoclonal antibodies or polypeptides of the invention can be 
administered parenterally by injection or by gradual infusion over time. 
Although the tissue to be treated can typically be accessed in the body by 
systemic administration and therefore most often treated by intravenous 

25 administration of therapeutic compositions, other tissues and delivery means 
are provided where there is a likelihood that the tissue targeted contains the 
target molecule and are known to those of skill in the art. 

Representative antibodies for use in the present invention are intact 
immunoglobulin molecules, substantially intact immunoglobulin molecules, 

30 single chain immunoglobulins or antibodies, those portions of an 
immunoglobulin molecule that contain the paratope, including antibody 
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fragments. It is within the scope of the present invention that a monovalent 
modulator can optionally be used. 

Methods of preparing "humanized" antibodies are generally well 
known in the art, and can readily be applied to the antibodies of the present 
5 invention. Humanized monoclonal antibodies offer particular advantages 
over monoclonal antibodies derived from other mammals, particularly insofar 
as they can be used therapeutically in humans. Specifically, humanized 
antibodies are not cleared from the circulation as rapidly as "foreign" 
antigens, and do not activate the immune system in the same manner as 

10 foreign antigens and foreign antibodies. 

With respect to the therapeutic methods of the present invention, a 
preferred subject is a vertebrate subject. A preferred vertebrate is warm- 
blooded; a preferred warm-blooded vertebrate is a mammal. A preferred 
mammal is a mouse or, most preferably, a human. As used herein and in 

15 the claims, the term "patient" includes both human and animal patients. 
Thus, veterinary therapeutic uses are provided in accordance with the 
present invention. 

Also provided is the treatment of mammals such as humans, as well 
as those mammals of importance due to being endangered, such as 

20 Siberian tigers; of economical importance, such as animals raised on farms 
for consumption by humans; and/or animals of social importance to humans, 
such as animals kept as pets or in zoos. Examples of such animals include 
but are not limited to: carnivores such as cats and dogs; swine, including 
pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, 

25 sheep, giraffes, deer, goats, bison, and camels; and horses. Also provided 
is the treatment of birds, including the treatment of those kinds of birds that 
are endangered and/or kept in zoos, as well as fowl, and more particularly 
domesticated fowl, i.e., poultry, such as turkeys, chickens, ducks, geese, 
guinea fowl, and the like, as they are also of economical importance to 

30 humans. Thus, provided is the treatment of livestock, including, but not 
limited to, domesticated swine, ruminants, ungulates, horses, poultry, and 
the like. 
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As used herein, the term "experimental subject" refers to any subject 
or sample in which the desired measurement is unknown. The term "control 
subject" refers to any subject or sample in which a desired measure is 
unknown. 

5 As used herein, an "effective" dose refers to a dose(s) administered to 

an individual patient sufficient to cause a change in MS4A activity. One of 
ordinary skill in the art can tailor the dosages to an individual patient, taking 
into account the particular formulation and method of administration to be 
used with the composition as well as patient height, weight, severity of 

10 symptoms, and stage of the biological condition to be treated. Such 
adjustments or variations, as well as evaluation of when and how to make 
such adjustments or variations, are well known to those of ordinary skill in 
the art of medicine. 

A therapeutically effective amount can comprise a range of amounts. 

15 One skilled in the art can readily assess the potency and efficacy of a MS4A 
modulator of this invention and adjust the therapeutic regimen accordingly. 
A modulator of MS4A biological activity can be evaluated by a variety of 
means including the use of a responsive reporter gene, interaction of MS4A 
polypeptides with a monoclonal antibody, analysis of cell subpopulations, 

20 and measurement of [Ca 2+ ]j levels, each technique described herein. 

Additional formulation and dose techniques have been described in 
the art, see for example, those described in U.S. Patent Nos. 5,326,902 and 
5,234,933, and International Publication No. WO 93/25521. 

For the purposes described above, the identified substances can 

25 normally be administered systemically, parenterally, or orally. The term 
"parenteral" as used herein includes intravenous, intra-muscular, intra- 
arterial injection, or infusion techniques. Other compositions for 
administration include liquids for external use, and endermic liniments 
(ointment, etc.), suppositories, and pessaries which comprise one or more of 

30 the active substance(s) and can be prepared by known methods. 
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IL CD20 Gene Family Members 

II.A. Identification of CD20 Gene Family Members 
The present invention provides MS4A nucleic acid and polypeptide 
sequences. Preferably, a MS4A gene comprises the sequence set forth as 
5 any one of the odd-numbered SEQ ID NOs:1-37, a nucleic acid molecule 
that is substantially similar to any one of the odd-numbered SEQ ID NOs:1- 
37, or a nucleic acid molecule comprising a 20 base pair nucleotide 
sequence that is identical to a contiguous 20 base pair sequence of any one 
of the odd-numbered SEQ ID NOs:1-37. 

10 To identify new CD20 gene family members, the human and mouse 

CD20 amino acid sequences (Tedder et al., 1988a; Tedder et al., 1988b) 
were used to search the translated GenBank databases, including 
expressed sequence tags, using the BLAST program (Altschul et al., 1997). 
Among 337 homologous sequences identified, at least 17 novel genes 

15 expressed by mouse, human, and pig had predicted amino acid sequences 
homologous to CD20. Complete coding regions were predicted using 
overlapping nucleotide sequences obtained from sequenced ESTs and 
cDNAs that corresponded to unique, near full-length transcripts in humans 
and mice (Figure 1). All nucleotide sequences were verified by sequencing 

20 multiple near full-length cDNAs isolated by applicants and 40 cDNAs 
obtained from the ATCC (American Tissue Culture Collection, Bethesda, 
Maryland, USA). In addition, a pig cDNA and its human counterpart 
homologous to CD20 were identified as GenBank submissions AJ236932.1 
and AK000224, respectively. In total, unique cDNA clones were identified 

25 that encode at least 16 distinct full-length CD20-like proteins. 

Complete cDNA sequences encoding the human and mouse MS4A 
family members (MS4A1, -A2, -A3, -AAA, -A5, -A6A, -A7, -A8B and -A12) 
were also used to search the GenBank human genomic database (htgs; 
http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program (Altschul et al., 

30 1997), as further described in Example 2. Two-hundred-twenty different 
contigs or distinct genomic DNA sequences were identified in the database 
of unfinished human genomic sequences that were either identical or similar 
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to MS4A family members. These sequences were predominantly derived 
from sixteen partially sequenced bacterial artificial chromosomes (BACs) 
that spanned 400-500 kb of human chromosome 11q12 (Table 1). Based on 
known cDNA sequences of MS4A family members, we were able to order 
5 and arrange these genomic sequences into overlapping continuous DNA 
segments. Since many of the contigs identified were overlapping, it was 
thereby possible to assemble long DNA sequences that encoded entire 
MS4A genes or portions of MS4A genes. Gaps between exon encoding 
DNA sequences were filled in many cases by additional sequence homology 

1 0 searches using DNA sequences found at the ends of gaps. When sequence 
differences were observed between different overlapping DNA fragments, 
consensus sequences were used or PGR primers were generated, that 
portion of genomic DNA was then amplified and sequenced to resolve 
ambiguous sequences. 

15 BLAST analysis of the htgs phase 1 or phase 2 human genomic DNA 

sequences encoding MS4A cDNAs and the assembled and annotated 
human genomic sequence thereof, as disclosed herein, revealed the 
presence of each known human MS4A family member. In addition, three 
putative genes encoding unique MS4A family members were identified that 

20 localized to the q1 2-1 3.1 region of human chromosome 11. Complete 
coding regions were predicted using overlapping nucleotide sequences 
obtained from sequenced ESTs and cDNAs and by comparison of gene 
structure, described further herein below (Figure 2). 

By identifying sequences that correlated with different MS4A genes in 

25 each BAG (Table 1), and by the assembly of minimal genomic DNA lengths 
that could encode each MS4A gene (Figure 2), we used the overlapping 
BACs to identify the order of the MS4A genes on chromosome 1 1 q1 2 
(Figure 6). This analysis also allowed us to determine the direction of gene 
transcription for most MS4A genes. Furthermore, the MS4A cDNA 

30 sequences, disclosed herein, were used to assemble genomic clones set 
forth as SEQ ID NOs:73-81. In some cases, multiple MS4A genes could be 
aligned within a continuous genomic sequence. For example, the genomic 
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sequence set forth as SEQ ID NO:77 comprises both the MS4A4E and 
MS4A6A genes. Similarly, the genomic region set forth as SEQ ID NO:79 
comprises three MS4A genes: MS4A7, MS4A5, and MS4A12. 

The MS4A4E gene encodes 660 bp of translated sequence (Figure 
5 3), contained within at least seven exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A4A sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 
Weissmann, 1987). The MS4A4E gene sequence was at least 23,379 base 
pairs in length, if counted from the putative translation initiation ATG site until 

10 the TGA translation termination stop site (Figure 2). An exon encoding the 
putative 5 f untranslated region of MS4A4E, was highly homologous with the 
corresponding sequence in MS4A4A cDNAs (disclosed herein). This 
sequence homology extended for >7 kbp upstream from this putative exon 
and also included upstream repetitive Alu elements. Representative 

15 upstream homologous sequences are shown in Figure 3. Similar sequence 
homologies were identified in the 3' untranslated regions of MS4A4E and 
MS4A4A, which extended beyond the poly-A attachment signal sequences 
(Figure 3). Based on the sequence similarities in translated and 
untranslated exons, it appears that the MS4A4E and MS4A4A genes 

20 resulted from a recent gene duplication event. 

The MS4A6E gene encodes 441 bp of translated sequence (Figure 
4), contained within at least four exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A6A cDNA sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 

25 Weissmann, 1987). In addition, the predicted gene sequences matched 
those found in three cDNA clones that were sequenced (ATCC Nos. 
3704466, 1852248 and 3557769). The MS4A6E gene was at least 5,060 bp 
in length, if counted from the putative translation initiation ATG site until the 
TGA translation termination codon (Figure 2). The MS4A6E gene lacks 

30 exons that encode the first two membrane spanning domains present in 
most MS4A family proteins (Figures 2 and 7). An exon homologous with the 
5' untranslated region of MS4A6A cDNAs was not identified within 7,629 bp 
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of sequence upstream of the exon encoding the translation initiation site of 
MS4A6E. However, there was a canonical 3' splice region upstream of the 
ATG initiation codon located at identical positions in the MS4A6E and 
MS4A6A genes. Similar sequence homologies were identified in the 3' 
5 untranslated regions of MS4A6E and MS4A6A that extend beyond the 
sequence shown in Figure 4. Based on the sequence similarities in 
translated and untranslated exons, it appears that the MS4A6E and 
MS4A6A genes represent a recent gene duplication event, although several 
exons encoding translated sequence were lost in the MS4A6E gene (Figure 
10 2). 

The MS4A10 gene encodes 726 bp of translated sequence (Figure 
5), contained within at least six exons (Figure 2). Exons were identified 
based on their sequence similarities with mouse MS4a10 cDNA sequences 
and the identification of canonical splice-donor and -acceptor sites (Aebi & 

15 Weissmann, 1987). The MS4A10 gene was at least 8,183 bp in length if 
counted from the putative translation initiation ATG site until the TGA 
translation termination stop site (Figure 2). An exon homologous with the 5' 
untranslated region of mouse MS4a10 cDNAs was not identified within 8,829 
bp of sequence upstream of the exon encoding the translation initiation site 

20 of MS4A10. However, there was a canonical 3' splice region upstream of 
the ATG initiation codon located at identical positions in the MS4A10 and 
MS4a10 genes. Modest sequence homologies were identified in the 3' 
untranslated regions of MS4A10 and MS4a10 (Figure 5). 
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Table 1 

Human BACs Containing MS4A Genes 



BAC 


Accession No. 3 
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A6A A2 A3 
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UI IM IUWI 1 


A4A A5 A6E A7 

AA"AA , A\vJ , nuu, #»• 


RP11-652L5 


a on h once 




A4A A4E A6A 


I— x 1 — v A A J i n K 1 o 

RP1 1-448N3 


a orvo /i nee 


1 1 
I I 


A8B 


RP11-312N17 


A 

agoza oyy 


1 1 
I I 


A8B A10 

t\\J U, n Iv 


RP1 1-196E16 


A PA07707 

AC027787 


I O 


A^ A1 


CMB9-79B2 


A nAAA7 /I O 

AP00074o 


I I L]Zo 


aid 


RP11-804A23 


a nnnr\777 

AP00U7/ / 


I I 


aid 

AA 1 \J 


RP1 1-736110 


ArUUUf yU 


I I Lj I <L 


A3 

AA\J 


RP1 1-804B24 


ArUUUyo4 


1 1 
I I 


A10 

i \ i vy 


RP11-729B4 


AP001034 


11q12 


A5, A12, A1 


CMB9-2M23 


AP001181 


11q12 


A2, A3 


CMB9-100I1 


AP001257 


11q12 


A6A, A4E 


CMB9-49F18 


AP001259 


11 


A8B 


RP11-68H20 


AP001986 


11q 


A10 



a GenBank Accession number for the indicated BAC. 
5 indicates the MS4A gene sequences that mapped to each BAC. 
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ll.B. MS4A Nomenclature 

In collaboration with the Human Gene Nomenclature Committee 
( www.gene.ucl.ac.uk/nomenclature/) , this gene family was designated as the 
MS4A family (Membrane Spanning 4-domain family, subfamily A). The MS4 
5 designation is to accommodate the future identification of genes encoding 
proteins with a similar structure, yet with unresolved functions. Subfamily A 
will designate the CD20 family. Using this nomenclature, the CD20 gene 
was designated as MS4A1, FceRlfi as MS4A2, and HTm4 as MS4A3. 
Among the 16 novel genes identified, 8 human genes were named MS4A4A, 

10 MS4A4E, MS4A5, MS4A6A, MS4A6E, MS4A7, MS4A8B, and MS4A12. A 
ninth gene encoded a protein homologous with the single member of the 
mouse MS4a10 subfamily. This gene was tentatively designated as 
MS4A10. The remaining genes were of mouse or pig origin and were 
therefore labeled as MS4a3-MS4a12 based on the nomenclature of 

15 homologous genes corresponding to human counterparts. Distinct mouse 
genes that encoded proteins with highly homologous sequences were 
designated as MS4a4B, MS4a4C, MS4a4D, and as MS4a6B, MS4a6C, and 
MS4a6D to signify close homology. 

II. C. MS4A Gene Chromosome Locations 

20 Chromosome locations for the human MS4A4A, MS4A6A, MS4A7, 

and MS4A8B genes were identified in two distinct homology searches. 
Regions of human MS4A4A (bp 1286-1588), MS4A6A (bp 682-1106), 
MS4A7 (bp 502-941), MS4A7 (bp 1015-1177), and MS4A8B (bp 1007- 
1350), were 98%, 98%, 97%, 99% and 97% identical with human STS 

25 genomic sequence tag sites, WI-11578, SHGC-36634, WI-12101, WIAF- 
3856, and WI-14145, respectively ( http://www.ncbi.nlm.nih.gov/blast) . 
These genomic sequence tag sites are located on human chromosome 1 1 at 
Genomic Database locus D11S1357-D11S913, which maps to 11q12-13 
( http://www.ncbi.nlm.nih.gov/genemap) . These mapping results were 

30 confirmed using the UniGene collection at the National Center for 
Biotechnology I nformation ( http://www.ncbi.nlm.nih.gov/Genemap98/) for 
expressed sequence tags identical to human MS4A4A, MS4A6A, MS4A7, 
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MS4A8B sequences. By this analysis, at least 7 of the 9 currently identified 
human MS4A genes are clustered. 

The organization of the 12 MS4A genes on human chromosome 11 
was determined by identifying sequenced human genomic DNA fragments 
5 (contigs of different lengths) from 15 BAC clones (Table 1). Contiguous 
DNA segments for each BAC were constructed based on human MS4A 
exon and cDNA sequences, and overlapping contigs. Although some gaps 
were present in MS4A gene introns (Figure 2) or between MS4A genes, the 
relative position of each gene on chromosome 11q12-13.1 was determined 

10 (Figure 6). MS4A1 was located in a telemetric region of 1 1q1 2-1 3.1 
compared with MS4A2 and MS4A3. Seven MS4A genes were located in 
between MS4A1 and MS4A2. Two other MS4A genes, MS4A8B and 
MS4A10 were centromeric to MS4A2 and MS4A3, although the distance 
between these genes was not determined. Interestingly, MS4A6A, 

15 MS4A4E, MS4A4A and MS4A6E were arranged linearly suggesting that 
these genes might have arisen through the duplication of a single genomic 
element. It is envisioned that this genetic locus extends further and contains 
additional MS4A genes. 

[LA. MS4AGene Structure 

20 Complete coding region sequences were verified for each deduced 

protein, except for the MS4a3 cDNA that was not full-length (Figure 1). 
Proposed ATG translation initiation codons were based on the translation 
initiation consensus sequence, ANN ATG (Kozak (1986) Cell 44:283-292), 
and the existence of in-frame upstream translation stop codons in most 

25 cases. Whether the first or second ATG codon in mouse MS4a8B was used 
for translation initiation was unknown although the second ATG was identical 
with the start codon of human MS4A8B (Figure 7). 

Poly(A) attachment signal sequences were identified in the proximal 
3' untranslated regions of each gene product except MS4A6A, MS4A6E, 

30 MS4A10, and MS4a6C. Two poly(A) signal sequences were found in 
MS4a4D, MS4A5, and MS4a10 transcripts, while four were observed in 
MS4A4A transcripts. 
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The disclosed MS4A cDNAs were further used to annotate the 
genomic sequence derived from BAC clones. Annotated features include 
definition of coding regions, intronjexon junctions, sequences upstream of 
the initial coding region of each gene that comprise the promoter region, and 
5 other adjacent sequences that could also comprise gene regulatory 
elements. Representative methods for further characterizing a MS4A 
promoter region are disclosed in Example 9. 

Annotation of human MS4A genomic regions (SEQ ID NOs:73-81), as 
disclosed herein, enabled a comparison of gene structure among MS4A 

10 genes. The overall domain organization of each MS4A gene was similar 
(Figures 2 and 7). All exon|intron|exon boundaries were consistent with 
consensus splice-donor and -acceptor sequences unless otherwise 
indicated, with exon|GTGAGT-intron-CAG|exon sequences in most cases 
(Aebi & Weissmann, 1987). In addition, the splice junctions for all translated 

15 exons were located after the third nucleotide in each codon. Most MS4A 
proteins were encoded by 6 exons except MS4A2, MS4A5, and MS4A6E 
(Figure 2 and 7). In these exceptions: the N-terminal cytoplasmic domain of 
MS4A2 was encoded by two exons (Kuster et al., 1992); the MS4A5 and 
MS4A6E genes did not encode C-terminal cytoplasmic domains; and the 

20 MS4A6E gene had only two membrane spanning domains. Intron lengths 
demonstrated wide variation from 181 bp in MS4A12 to 13,731 bp in 
MS4A5. In some cases however, exact intron lengths were not determined; 
MS4A3, MS4A4, and MS4A12 (Figure 2). Distances between translation 
initiation and termination codons were determined for most MS4A genes; 

25 with MS4A6E being the smallest (5,060 bp) and MS4A4E being the longest 
(23,379 bp) genes (Figure 6). Thus, the intron|exon organization of all 
MS4A family members is consistent with the high degree of conservation 
within this gene family. 

There were no amino-terminal signal sequences, although all MS4A 

30 proteins contained hydrophobic regions of sufficient length to pass through 
the membrane at least four times. Notable was a marked clustering of 
charged residues at both ends of the putative transmembrane domains, 
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some of which were highly conserved. In some cases, the first and second 
putative transmembrane domains of MS4A proteins were a continuous 
stretch of hydrophobic amino acids without an obvious inter-transmembrane 
hydrophilic bridge. By contrast, MS4A4A and MS4A7 had 6 to 7 hydrophilic 
5 amino acids inserted between the first and second hydrophobic domains. In 
human MS4A4A and mouse MS4a4B, MS4a4C, and MS4a4D, an extensive 
hydrophobic region followed the fourth putative membrane-spanning 
domain. Thus, the overall structure of MS4A family members was well 
conserved. 

10 ILE. MS4A Gene Splice Variants 

Among the MS4A cDNAs sequenced and EST sequences analyzed, 
multiple splice variants were identified that encoded variant MS4A proteins. 
In most cases, exons were spliced out, which generated truncated protein 
products. Potential splice variants of the MS4A4A, MS4A5, MS4A6A, and 

15 MS4A7 genes were identified. Whether these alternatively spliced variants 
produce functional proteins has yet to be determined. 

Two splice variations of the MS4A4A gene were identified during an 
analysis of MS4A4A mRNA expression by lymphoblastoid cell lines. Most of 
the hematopoietic cell lines examined expressed transcripts encoding a full- 

20 length MS4A4A protein as shown in Figure 7. However, a second smaller 
transcript was also expressed in most cases that contained a potential exon 
deletion of 158 nucleotides. This was a frequent event since 40% of 
MS4A4A cDNAs generated from the BJAB B cell line encoded the truncated 
protein. In addition, the same splicing event was observed in two of five 

25 EST sequences that covered this region of the MS4A4A protein. Splicing- 
out this potential exon deleted the third membrane-spanning domain and the 
second extracellular loop from the full-length protein (positions 110-163, 
Figure 3). Of interest, this splicing event fused the first/second membrane 
spanning domains with the fourth membrane spanning domain. However, 

30 the fourth transmembrane spanning domain in MS4A4A is followed by 
another hydrophobic region of sufficient length to traverse the membrane 
(disclosed herein). This suggests that differential splicing can generate an 
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alternative MS4A4A protein with four membrane spanning domains lacking a 
significant extracellular domain. 

In the case of the MS4A5 gene, two of nine MS4A5 EST sequences 
analyzed (GenBank Accession Nos. AA411806 and AA781801) encoded a 
5 splice variant that preserved the reading frame of the transcript. In both 
sequences, the exon encoding the third membrane-spanning domain and 
the second extracellular loop from the full-length protein (TM3, Figure 1) was 
spliced out using normal splice-donor and -acceptor sequences, which 
deleted 51 amino acids (114-164) from the full length protein (Figure 7). 

10 This deletion resulted in a protein with the first/second membrane spanning 
domains fused with the fourth predicted membrane-spanning domain. Thus, 
the truncated MS4A5 protein would possess three membrane-spanning 
domains with an extracellular carboxyl-terminal domain. 

A novel splicing event was observed in the MS4A6A gene which 

15 resulted in a truncated protein. A novel splice donor site (CAG T 683 |GT GAG 
T) is located within the exon encoding the TM3/extracellular loop domains 
(Figure 4). This cryptic splice donor site was spliced with the normal 3' 
splice acceptor site of the exon encoding the TM4 domain, which thereby 
deletes nucleotides 684-787 from MS4A6A transcripts (Figure 4). Since 

20 there was an extra T introduced into the codon sequence due to this 
alternative splicing event, there was a frameshift in the coding sequence. 
This potentially results in the attachment of a novel 30 amino acid sequence 
(-WNSLSDADLHSAGILPSCAHCCAAVETGLL) that is not predicted to be 
hydrophobic. Thus, the variant MS4A protein would be 70 amino acids 

25 shorter and would tack the fourth membrane-spanning and cytoplasmic 
domains. This alternative splicing event was found in 3 of 29 EST 
sequences that encoded this region (GenBank Accession Nos. AI278475, 
AA461046, and AA448335) and in one cDNA clone (GenBank Accession 
No. AB013104). 

30 Splice variation in MS4A7A transcripts produces two distinct protein 

products in addition to the presumably normal protein. In one case, a splice 
variation in MS4A7A transcripts produces a protein product similar in 
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structure to the MS4A6E protein. The exon encoding the first/second 
membrane spanning domains (amino acids 50-94, Figure 7) was deleted in 
2 of 4 MS4A7 EST sequences analyzed (GenBank Accession Nos. N42191 
and R11179) that cover this region. Thus, the protein product would have a 
5 longer N-terminal cytoplasmic domain and only two membrane spanning 
domains. In the second case, the exon encoding the fourth membrane- 
spanning domain (amino acids 183-216) was deleted in 2 EST sequences 
(GenBank Accession Nos. R11180 and AM 88478) out of 18 sequences 
analyzed (Figure 7). 

10 j_LF. MS4A Gene Polymorphisms 

Putative polymorphisms were identified in the MS4A6A gene. Two 
nucleotide substitutions were found in cDNA clone ATCC No. 499181 and in 
13 of 38 EST sequences analyzed (Figure 1). The first substitution was at 
nucleotide 373 that exchanged a C for a T, which did not alter the amino 

15 acid sequence. The second substitution resulted in a Ser in place of Thr at 
amino acid 185. In addition, a third substitution was found in 4 of the 38 
EST sequences analyzed where a Ser was substituted in place of an Ala at 
amino acid position 183. This substitution was paired with a Ser to Thr 
substitution at amino acid position 185 in half of the clones analyzed. These 

20 differences most likely represent common sequence polymorphisms since 
they were observed in multiple independent cDNA clones. Based on our 
genetic DNA analysis, it is unlikely that these differences could represent 
transcripts from distinct genes that are almost identical in coding sequence. 
As with the MS4A6A gene (disclosed herein), potential gene 

25 polymorphisms were observed in MS4A6E. Three cDNA clones 
representing partial transcripts were sequenced completely on both strands. 
The predicted MS4A6E gene product and one cDNA clone (ATCC No. 
3704466) had identical sequences. However, the ATCC No. 3557769 cDNA 
had a nucleotide substitution at position 314 (Figure 4) that exchanged a T 

30 for a C, which did not alter the predicted amino acid sequence. The ATCC 
No. 1852248 cDNA clone had the longest insert that starts at nucleotide 
position 60 and ended at position 661 as shown in Figure 4. This cDNA had 
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a substitution at nucleotide 153 that exchanged a G for a T, which resulted in 
a Phe in place of Val at amino acid 47 (Figure 4). Therefore, sequence 
polymorphisms can exist within the MS4A6E gene. 

Other potential polymorphisms were observed in other MS4A family 
5 members based on consistent nucleotide variations found in MS4A4E 
sequences. 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 11q12-13.1, disclosed 
herein for the first time, provide source material for identification of 

10 polymorphisms that are linked to MS4A genes. Such polymorphisms can 
include single nucleotide polymorphisms as disclosed within the MS4A6A 
and MS4A6E coding region sequences. In addition, polymorphisms within 
or genetically linked to MS4A genes can also comprise restriction length 
polymorphisms (RFLPs) (Lander & Botstein (1989) Genetics 121:185-199), 

15 short tandem repeat polymorphisms (STRPs), short sequence length 
polymorphisms (SSLPs) (Dietrich et al. (1996) Nature 380:149-152), 
amplified fragment length polymorphisms (AFLPs) (Latorra et al. (1994) PCR 
Methods Appl 3(6):351-358), and microsatellite markers (Schalkwyk et al. 
(1999) Genome Res 9:878-887). Identification of polymorphisms within an 

20 isolated DNA molecule are known to one of skill in the art. 
II. G. MS4A Proteins 

The MS4A genes encoded proteins of 16-29 kDa (Table 2). 
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Table 2 
MS4A Family Members 



Human 


Mouse 


Human/Mouse 


Name 


kDa 


Name 


kDa 


Homology 






MS4a3 




63% (partial) 


MS4A4A 


23 












Ms4a4B 


24 


41% 






Ms4a4C 


24 


44% 






Ms4a4D 


24 


40% 


MS4A4E 


24 








MS4A5 


22 








MS4A6A 


27 












Ms4a6B 


27 


52% 






Ms4a6C 


24 


51% 






Ms4a6D 


26 


53% 


MS4A6E 


16 








MS4A7 


26 


MS4a7 


26 


53% 


MS4A8B 


26 


MS4a8B 


29 


63% 


MS4A10 


27 


MS4a10 


29 


52% 


MS4A12 


26 


MS4a12( P ig) 


26 


60% 



Predicted molecular weights for the new MS4A family members and the 
5 percentage amino acid sequence identity between deduced MS4A and 
MS4a proteins. 



WO 02/062946 



53 



PCT/US01/48437 



Comparisons between CD20 and the predicted amino acid 
sequences for human MS4A4A, MS4A5, MS4A6A, MS4A7, MS4A8B, and 
MS4A12 revealed 23-29% amino acid sequence identity (Figure 7). The 
highest degree of identity was found in the first three transmembrane 
5 domains with multiple regions of conserved amino acids. In particular, the 
amino acid sequences LGAXQI (SEQ ID NO:57) and LSLG (SEQ ID NO:58) 
were common within the first transmembrane domain, GYPFWG (SEQ ID 
NO:60) and FIISGSLS (SEQ ID NO:61) were common in the second 
domain, and SLX2NX2SX3AX2G (SEQ ID NO:62) was found in the third 

10 transmembrane domain, the first and second transmembrane domains of 
MS4A8B were 46% identical in amino acid sequence with human CD20, 
41% identical with FcsRIp, and 39% identical with HTm4. The MS4A4A, 
MS4A5, MS4A6A, and MS4A7 proteins were most homologous in their first 
and second transmembrane domains with the human FcsRip chain, with 37- 

15 46% amino acid sequence identity. There was large variation between 
MS4A proteins in the N- and C-terminal cytoplasmic domains. However, 
Pro residues were significantly over-represented within the N- and C- 
terminal cytoplasmic domains of most MS4A family members. There was 
some sequence identity in the first potential extracellular loop that was -13 

20 amino acids in length for each protein. By contrast, the second predicted 
extracellular loop ranged from 10-46 amino acids in length with diverse 
sequences. 

The putative MS4A4E gene encodes a 220 amino acid protein of 23.8 
kDa with a predicted amino acid sequence that is 76% identical with the 

25 MS4A4A protein (Figure 3). Consistent with other MS4A proteins, the most 
significant homologies between MS4A4E and other MS4A family members 
were found in the membrane spanning domains (Figure 7). Common amino 
acid motifs were readily visualized such as KXLGAIQI (SEQ ID NO:57), 
GYPXWG (SEQ ID NO:60), and SGXLSI (SEQ ID NO:59) in the first and 

30 second hydrophobic regions that represent potential transmembrane 
regions. The intracellular N- and C-terminal domains were highly conserved 
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between MS4A4E and MS4A4A, but were divergent from other family 
members. 

The putative MS4A6E gene encodes a 147 amino acid protein of 15.9 
kDa with a predicted amino acid sequence that is 78% identical with the 
5 MS4A6A protein (Figure 4). The most significant homologies between 
MS4A6E and other MS4A family members were found in the membrane 
spanning domains, although MS4A6E only had two (TM3 and TM4) 
membrane-spanning domains (Figures 4 and 7). The putative second 
extracellular loops of MS4A6E and MS4A6A were of identical length (Figure 

10 4). Common amino acid motifs were readily visualized in the hydrophobic 
regions that represent potential transmembrane regions. The intracellular N- 
terminal domain was highly conserved between MS4A6E and MS4A6A, but 
were divergent from other family members. MS4A6E protein also lacks a C- 
terminal cytoplasmic domain (Figure 4). 

15 The putative MS4A10 gene encodes a translated 241 amino acid 

protein of 26.9 kDa with a predicted amino acid sequence that is 52% 
identical with the mouse MS4a10 protein (Figure 5). The most significant 
homologies between MS4A10 and MS4a10 were found in the membrane 
spanning domains and the putative second extracellular loop (Figure 5), 

20 Although the N-terminal cytoplasmic domains of MS4A10 and MS4a10 were 
of similar length, the intracellular N- and C-terminal domains had the lowest 
sequence homologies among domains. The cytoplasmic C-terminal domain 
was 28 amino acids shorter in MS4A10 than MS4a10. Nonetheless, based 
on the sequence similarities of translated regions, it appears that MS4A10 

25 and MS4a10 represent homologous genes that are more similar to one 
another than other MS4A family members. 

Ten novel mouse MS4A proteins were identified that shared 40-63% 
amino acid sequence identity with their potential human counterparts (Figure 
7, Table 2). For comparison, the mouse and human CD20 proteins are 74% 

30 identical in amino acid sequence (Tedder et al., 1988a). A single partial 
cDNA was identified that encoded the mouse homologue for HTm4 (MS4a3, 
Figure 7). The predicted amino terminus of the proposed MS4a3 protein 
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was 23 amino acids shorter than in the human protein, although their 
overlapping regions were 63% identical in amino acid sequence. In all 
cases, the transmembrane domains of the human and mouse MS4A 
proteins were the most well conserved regions. For example, the human 
5 MS4A8B protein was 78% identical in sequence to MS4a8B in the first 3 
transmembrane domains and 68% identical in domain 4. Additional MS4A 
genes are likely to be identified in humans and mice, including the mouse 
MS4A5 homologue. 

A UPGMA (unweighted pair group method using arithmetic averages) 
10 tree showing relatedness of deduced MS4A and MS4a protein sequences is 
depicted in Figure 8. 

ML Methods for Detecting a MS4A Nucleic Acid Molecule 

In another aspect of the invention, a method is provided for detecting 

15 a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is procured and 
hybridized under stringent hybridization conditions to a MS4A nucleic acid 
molecule of the present invention. Such hybridization enables a nucleic acid 
molecule of the biological sample and the MS4A nucleic acid molecule to 

20 form a detectable duplex structure. Preferably, the MS4A nucleic acid 
molecule includes some or all nucleotides of any one of the odd-numbered 
SEQ ID NOs:1-37. Also preferably, the biological sample comprises human 
nucleic acid material. 

III.A. Expression of MS4A Family Members in Hematopoietic Cells 

25 Since CD20, FceRIB, and HTm4 expression are restricted to 

hematopoietic tissues, MS4A gene transcription was assessed by PCR 
amplification of cDNA from eleven human hematopoietic cell lines. Like 
CD20, MS4A8B was only expressed by B cell lines (Table 3). MS4A5 was 
only expressed by a promonocytic cell line. MS4A6A transcripts were 

30 expressed by B cell, myelomonocytic, and erythroleukemia cell lines. 
MS4A4A mRNA was expressed by all cell lines examined, although the 
relative mRNA levels varied significantly. MS4A7 was expressed in most, 
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but not all of the cell lines tested. MS4A12 transcripts were not detected in 
these cell lines. Thus, most MS4A family members are likely to be 
expressed in hematopoietic tissues. 

ESTs encoding MS4A transcripts were isolated from a variety of 
5 different cDNA libraries. MS4A4A ESTs were from aorta, brain, breast, 
heart, kidney, lung, ovary, pancreas, placenta, prostate, stomach, testis, and 
uterine tissues. MS4A5 ESTs were only isolated from testis. MS4A6A ESTs 
were from aorta, brain, the central nervous system, colon, gall bladder, 
heart, kidney, lung, muscle, ovary, pancreas, placenta, prostate, skin, 

10 stomach, tonsil, uterus and embryonic tissues. MS4A7 ESTs were from 
lung, kidney, lymphocytes, mammary gland, placenta, spleen, testis, thymus, 
and uterine tissues. MS4A8B ESTs were from brain, lung, uterus and 
embryonic tissues. A single MS4A12 EST was isolated from colon. This 
demonstrates differential MS4A gene transcription among lymphoid and 

15 non-lymphoid tissues. 
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Since most of the MS4A genes are expressed by hematopoietic cells, 
MS4A4E, MS4A6E and MS4A10 transcription were assessed by RT-PCR 
amplification of cDNA from human hematopoietic cell lines and human 
tissues. Transcripts from eleven human hematopoietic cell lines were 
5 evaluated; one pre-B cell line (NALM-6), three B cell lines (BJAB, DAUDI, 
and SB), four T cell lines (HSB-2, HUT-78, JURKAT, and MOLT15), two 
myelomonocytic lines (HL60 and U937), and one erythroleukemia cell line 
(K562). In addition, transcripts from eight human tissues were evaluated; 
colon, ovary, peripheral blood leukocytes, prostate, small intestine, spleen, 

10 testes and thymus. However, MS4A4E, MS4A6E and MS4A10 transcripts 
were not detected in any of these cell lines or tissues. 

MS4A4E, MS4A6E, and MS4A10 sequences were also used to 
search the translated GenBank databases using the BLAST program 
(Altschul et al., 1997). Eleven EST sequences representing MS4A6E 

15 transcripts were found that represented nine cDNAs isolated from pooled 
fetal organ libraries (GenBank Accession Nos. AA382998, AA909515, 
AA917066, AI222355, AI279944, AI684553, AI699419, AI743473, 
AI806247), one cDNA from a pooled germ cell tumor library (GenBank 
Accession No. AI968835), and one cDNA from a colon tumor (GenBank 

20 Accession No. AW951636). EST cDNAs encoding MS4A4E or MS4A10 
sequences were not identified. This suggests that MS4A4E, MS4A6E, and 
MS4A10 transcripts are rare among normal tissues or they are primarily 
expressed during oncogenesis or embryogenesis. 

MS4a gene expression by mouse tissues was assessed by Northern 

25 analysis and PCR amplification of cDNAs (Table 4). In most cases 
assessed, Northern analysis failed to detect specific MS4a transcripts in 
tissues that revealed transcript production by PCR amplification. These 
results suggest that MS4a transcripts are only produced by subpopulations 
of cells within each tissue such that transcript levels were often below the 

30 level of detection by Northern analysis. Nonetheless, MS4a4B, MS4a4C, 
and MS4a6B transcripts were found at high levels in thymus, spleen and 
peripheral lymph nodes, with less abundant levels in non-lymphoid tissues. 
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MS4a6C was only expressed by thymus, spleen, PLN and bone marrow. 
MS4a4C, MS4a6D and MS4a7 were expressed in all tissues examined. 
MS4a8B transcripts were expressed by spleen, peripheral lymph nodes, 
colon, liver, heart, lung and bone marrow. MS4a 10 transcripts were found in 
5 thymus, kidney, colon, brain, and testis. In addition, CD20 (MS4a1), FceRip 
(MS4a2), and MS4a3 expression were primarily restricted to hematopoietic 
tissues. MS4a3, MS4a4B, MS4a4C, MS4a6B, MS4a6C, MS4a6D, MS4a7, 
MS4a8B, and MS4a10 were also expressed by various hematopoietic and 
lymphoblastoid cell lines. Therefore, most MS4a family members were 
10 expressed by hematopoietic cells. 

Table 4 

MS4a Gene Expression by Mouse Tissues 8 
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a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from tissue samples. Values represent the level of 
15 PCR product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control as described for Table 3. Peripheral lymph 
node (PLN) and bone marrow (BM). 
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Expression of MS4A family members was also assessed in mouse 
hematopoietic cell lines (Table 5). Nine of the twelve MS4A genes were 
expressed in pre-B cell lines and five of the MS4A genes were expressed in 
B cell lines. Six of the MS4A genes were expressed by T cell lines. These 
5 data suggest that B cells can express most members of the MS4A gene 
family, although the patterns of expression of each gene is distinct. 

Table 5 

MS4a Expression by Mouse Lymphoid Tissues and Cell Lines 3 
Tissues Pre B cell lines B cell lines T cell lines 
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10 



a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from each cell type. Values represent the level of PCR 
product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control in three separate PCR reactions: no 
15 specific PCR product detected; +, low levels of the appropriate band were 
detectable; ++ to +++, appropriate bands of increasing intensity were readily 
visualized in all samples examined. Identical results were obtained using 
two different primer pairs for cDNA amplification. 
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MLB. Detection of MS4A Polymorphisms 

In another embodiment, genetic assays based on nucleic acid 
molecules of the present invention can be used to screen for genetic 
variants by a number of PCR-based techniques, including single-strand 
5 conformation polymorphism (SSCP) analysis (Orita, M., et al. (1989) Proc 
Natl Acad Sci USA 86(8):2766-2770), SSCP/heteroduplex analysis, enzyme 
mismatch cleavage, and direct sequence analysis of amplified exons (Kestila 
et al. (1998) Mol Cell 1(4):575-582; Yuan et al. (1999) Hum Mutat 14(5):440- 
446). Automated methods can also be applied to large-scale 

10 characterization of single nucleotide polymorphisms (Brookes (1999) Gene 
234(2):177-186; Wang et al. (1998) Science 280(5366): 1077-82). The 
present invention further provides assays to detect a mutation of a variant 
MS4A locus by methods such as allele-specific hybridization (Stoneking et 
al. (1991) >4a77 J Hum Genet 48(2):370-82), or restriction analysis of amplified 

15 genomic DNA containing the specific mutation. 

IV. Recombinant Production of a MS4A Polypeptide 

The present invention also provides a method for recombinant 
production of a MS4A polypeptide, as described in Example 3. Preferably, 
20 the recombinant polypeptide comprises some or all of the amino acid 
sequences of any one of the even-numbered SEQ ID NOs:2-38. 

Recombinantly produced proteins are useful for a variety of purposes, 
including structural determination of a MS4A polypeptide, generation of an 
antibody that recognizes a MS4A polypeptide, and screening assays to 
25 identify a chemical compound or peptide that interacts with a MS4A 
polypeptide, described further herein below. 

V. Production of MS4A Antibodies 

In another aspect, the present invention provides a method of 
30 producing an antibody immunoreactive with a MS4A polypeptide, the 
method comprising recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof, to be used as an antigen. The MS4A 
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polypeptide is formulated so that it is can be used as an effective 
immunogen. An animal is immunized with the formulated MS4A 
polypeptide, generating an immune response in the animal. The immune 
response is characterized by the production of antibodies that can be 
5 collected from the blood serum of the animal. Optionally, cells producing a 
MS4A antibody can be fused with myeloma cells, whereby a monoclonal 
antibody can be selected. Exemplary methods for producing a monoclonal 
antibody that recognizes a MS4A protein are described in Example 4. 
Preferred embodiments of the method use a polypeptide set forth as any 

1 0 one of the even-numbered SEQ ID NOs:2-38. 

The present invention also encompasses antibodies and cell lines 
that produce monoclonal antibodies as described herein. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and activity of the MS4A polypeptide sequences of 

15 the invention, e.g., for cloning of MS4A nucleic acids, immunopurification of 
MS4A polypeptides, imaging MS4A polypeptides in a biological sample, 
measuring levels thereof in appropriate biological samples, and in diagnostic 
methods. 

20 VL Methods for Detecting a MS4A Polypeptide 

In another aspect of the invention, a method is provided for detecting 
a level of MS4A polypeptide using an antibody that specifically recognizes a 
MS4A polypeptide, or portion thereof. In a preferred embodiment, biological 
samples from an experimental subject and a control subject are obtained, 

25 and MS4A polypeptide is detected in each sample by immunochemical 
reaction with the MS4A antibody. More preferably, the antibody recognizes 
amino acids of any one of the even-numbered SEQ ID NOs:2-38, and is 
prepared according to a method of the present invention for producing such 
an antibody. 

30 In one embodiment, a MS4A antibody is used to screen a biological 

sample for the presence of a MS4A polypeptide. A biological sample to be 
screened can be a biological fluid such as extracellular or intracellular fluid, 
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or a cell or tissue extract or homogenate. A biological sample can also be 
an isolated cell (e.g., in culture) or a collection of cells such as in a tissue 
sample or histology sample. A tissue sample can be suspended in a liquid 
medium or fixed onto a solid support such as a microscope slide. In 
5 accordance with a screening assay method, a biological sample is exposed 
to an antibody immunoreactive with a MS4A polypeptide whose presence is 
being assayed, and the formation of antibody-polypeptide complexes is 
detected. Techniques for detecting such antibody-antigen conjugates or 
complexes are well known in the art and include but are not limited to 
10 centrifugation, affinity chromatography and the like, and binding of a labeled 
secondary antibody to the antibody-candidate receptor complex. 

In one embodiment, an antibody that specifically recognizes a MS4A 
polypeptide can be used to assess the tissue- or cell-distribution of MS4A 
protein, for example, to evaluate CD20 expression during B lymphocyte 

15 development (Figure 9). CD20 expression in B220 + lymphocytes from 
lymphoid tissues of wild type mice was examined by two-color 
immunofluorescence. In bone marrow, three types of B220 + cells were 
detected. The vast majority of B220 hi lymphocytes expressed CD20. 
However, the majority of B220 l0 lymphocytes were CD20-negative. Thus, 

20 CD20 was predominantly expressed by mature B cells. 

CD19 expression is restricted to normal and neoplastic B cells and 
follicular dendritic cells. CD19 is expressed early by B progenitor cells in the 
bone marrow, presumably at the late pro-B or early pre-B cell stages around 
the time of immunoglobulin heavy chain rearrangement (Anderson et al. 

25 (1984) Blood 63:1424). Expression persists during all stages of B cell 
maturation and is lost upon terminal differentiation to plasma cells. 

Double staining of CD20 with IgM and CD19 antibodies showed that 
some of the CD19 10 and IgM 10 cells were CD20 negative in the bone marrow. 
A few IgM- cells also expressed low levels of CD20 in the bone marrow. 

30 This data suggested that the CD20 expression was later than the CD19 
expression but before or around the time of IgM expression during B cell 
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development in the bone marrow since these cells were gated on 
lymphocytes not dendritic cells. 

The level of CD20 expression observed on mature B220 hi B cells in 
bone marrow was maintained by B cells from peripheral lymphoid tissues. 
5 The vast majority of B220 + B cells in the spleen, blood, peripheral lymph 
nodes, and peritoneal cavity expressed CD20. Therefore, like human CD20, 
mouse CD20 was also exclusively expressed on B cells from the immature B 
cell stage to mature B cells. 

10 V1L Identification of MS4A Modulators 

VILA. Screening for Small Molecule Liaands that Interact with a 

MS4A Polypeptide 
The present invention further discloses a method for identifying a 
compound that modulates MS4A function. According to the method, a 
15 MS4A polypeptide is exposed to a plurality of compounds, and binding of a 
compound to the isolated MS4A polypeptide is assayed. A compound is 
selected that demonstrates specific binding to the isolated MS4A 
polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
20 numbered SEQ ID NOs:2-38. 

Several techniques can be used to detect interactions between a 
protein and a chemical ligand without employing an in vivo ligand. 
Representative methods include, but are not limited to, Fluorescence 
Correlation Spectroscopy, Surface-Enhanced Laser Desorption/lonization 
25 Time-Of-flight Spectroscopy, and Biacore technology, as described in 
Example 5. These methods are amenable to automated, high-throughput 
screening. 

Candidate regulators include but are not limited to proteins, peptides, 
and chemical compounds. Structural analysis of these selectants can 
30 provide information about ligand-target molecule interactions that enable the 
development of pharmaceuticals based on these lead structures. 
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Similarly, the knowledge of the structure a native MS4A polypeptide 
provides an approach for rational drug design. The structure of a MS4A 
polypeptide can be determined by X-ray crystallography or by computational 
algorithms that generate three-dimensional representations. See Huang et 
5 al. (2000) Pac Symp Biocomput 230-41; Saqi et al. (1999) Bioinformatics 
15:521-522. Computer models can further predict binding of a protein 
structure to various substrate molecules, that can be synthesized and tested. 
Additional drug design techniques are described in U.S. Patent Nos. 
5,834,228 and 5,872,011. 

10 VII. B. Methods for Identifying Modulators of MS4A Gene Expression 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 1 1 q1 2-13.1 , disclosed 
herein for the first time, identify MS4A gene regulatory regions. Preferably, 
MS4A gene regulatory regions comprise sequences upstream of the initial 

15 coding region of each MS4A gene as disclosed in SEQ ID NOs:73-81. An 
expression cassette comprising a MS4A promoter region can be employed 
in assays for the identification of modulators of MS4A expression. Thus the 
present invention also provides a method for identifying a substance that 
regulates MS4A gene expression using a chimeric gene that includes an 

20 isolated MS4A gene promoter region operably linked to a reporter gene. 
According to this method, a gene expression system is established that 
includes the chimeric gene and components required for gene transcription 
and translation so that reporter gene expression is assayable. To select a 
substance that regulates MS4A gene expression, the method further 

25 provides the steps of using the gene expression system to determine a 
baseline level of reporter gene expression in the absence of a candidate 
regulator; providing one or more candidate regulators to the gene expression 
system; and assaying a level of reporter gene expression in the presence of 
a candidate regulator. A candidate regulator is selected whose presence 

30 results in an altered level of reporter gene expression when compared to the 
baseline level. 

Several molecular cloning strategies can be used to identify 
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substances that specifically bind a MS4A gene cis-regulatory element. In 
one embodiment, a cDNA library in an expression vector, such as the 
Iambda-gt1 1 vector, can be screened for cDNA clones that encode a MS4A 
gene regulatory element DNA-binding activity by probing the library with a 
5 labeled MS4A DNA fragment, or synthetic oligonucleotide (Singh et al. 
(1989) Biotechniques 7:252-261). Preferably, the nucleotide sequence 
selected as a probe has already been demonstrated as a protein binding site 
using a protein-DNA binding assay, as described in Example 9. 

In another embodiment, transcriptional regulatory proteins are 

10 identified using the yeast one-hybrid system (Luo et al. (1996) Biotechniques 
20(4):564-568; Vidal et al. (1996) Proc Natl Acad Sci USA 93(19):10315- 
10320; Li & Herskowitz (1993) Science 262:1870-1874). In this case, a cis- 
regulatory element of a MS4A gene is operably fused as an upstream 
activating sequence (UAS) to one, or typically more, yeast reporter genes 

15 such as the lacZ gene, the URA3 gene, the LEU2 gene, the HIS3 gene, or 
the LYS2 gene, and the reporter gene fusion construct(s) is inserted into an 
appropriate yeast host strain. It is expected that the reporter genes are not 
transcriptionally active in the engineered yeast host strain, for lack of a 
transcriptional activator protein to bind the UAS derived from the MS4A gene 

20 promoter region. The engineered yeast host strain is transformed with a 
library of cDNAs inserted in a yeast activation domain fusion protein 
expression vector, e.g. pGAD, where the coding regions of the cDNA inserts 
are fused to a functional yeast activation domain coding segment, such as 
those derived from the GAL4 or VP16 activators. Transformed yeast cells 

25 that acquire a cDNA encoding a protein that binds a cis-regulatory element 
of a MS4A gene can be identified based on the concerted activation the 
reporter genes, either by genetic selection for prototrophy (e.g. LEU2, HIS3, 
or LYS2 reporters) or by screening with chromogenic substrates (e.g., a lacZ 
reporter) by methods known in the art. 

30 The present invention also provides an in vivo assay for discovery of 

modulators of MS4A gene expression. In this case, a transgenic non-human 
animal is made such that a transgene comprising a MS4A gene promoter 
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and a reporter gene is expressed and a level of reporter gene expression is 
assayable. Such transgenic animals can be used for the identification of 
compounds that are effective in modulating MS4A gene expression. 
In vitro or in vivo screening approaches can also survey more than one 
5 modulatable transcriptional regulatory sequence simultaneously. 

VIM, Animal Models 

The present invention further pertains to an animal model of disorders 
associated with a MS4A nucleic acid or polypeptide, including but not limited 

10 to atopic disorders, abnormal target cell development, function, and Ca ++ 
responses. Such a model can be prepared by several methods. Using a 
transgenic approach, knock-out, knock-in, or knock-down mutation of the 
MS4A gene can suppress MS4A function. The present invention also 
teaches that an animal model of a MS4A-related disorder can be prepared 

15 by immunizing an animal with a MS4A polypeptide. The resulting immune 
response in the animal comprises a production of antibodies that specifically 
bind a MS4A polypeptide, thereby disrupting its biological activity. A method 
is also provided for generating an animal model of a MS4A-related disorder 
by administering to an animal a compound that disrupts MS4A expression or 

20 function. Such a compound is discovered by methods disclosed herein. 
VIII.A. Generation of CD20-Deficient Mice 

CD20-deficient mice were generated by targeted disruption of the 
CD20 gene in embryonic stem (ES) cells using homologous recombination, 
as described in Example 6. A targeting vector was generated that replaces 

25 exons encoding part of the second extracellular loop, the 4 th transmembrane 
domain, and the large carboxyl-terminal cytoplasmic domain of CD20 with a 
neomycin resistant gene (Figure 10A-D). Appropriate gene targeting 
generates an aberrant CD20 protein truncated at amino acid position 157 
and fused with an 88 amino acid protein encoded by the Neo r gene promoter 

30 sequence. 

After DNA transfections, 6 of 1 15 Neo-resistant ES cell clones carried 
the targeted allele as determined by Southern blot analysis of EcoR V 
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digested genomic DNA using a 1.5 kb DNA probe (Figure 10D). Appropriate 
targeting was further verified in two clones by Southern analysis of ES cell 
DNA digested with BamH I (>12 kb fragment was reduced to a 6.5 kb band 
in targeted cells), Kpn I (7.2 kb became 5.5 kb), and Ssp I (5.6 kb became 
5 7.0 kb) using the same probe. Cells of one ES cell clone were injected into 
blastocysts that were transferred into foster mothers. Highly chimeric male 
offspring (80-100% according to coat color) bred with C57BL/6 (B6) females 
transmitted the mutation to their progeny (Figure 10E). Mice homozygous 
for disruption of the CD20 gene were obtained at the expected Mendelian 

1 0 frequency by crossing heterozygous offspring. 

Appropriate targeting of the CD20 gene was further verified by PCR 
analysis of genomic DNA from homozygous offspring (Figure 10F). Wild 
type CD20 mRNA was absent in CD20" 7 ' mice as confirmed by PCR 
amplification of cDNA generated from splenocytes of CD20" /_ mice (Figure 

15 10G). CD20-deficient mice (CD20' / ") thrived and reproduced as well as their 
wild type littermates and did not present any obvious anatomical or 
morphological abnormalities during the first year of life. 

Absence of cell surface CD20 protein expression in CD20' 7 " mice was 
further verified by staining B220 + splenocytes with murine anti-CD20 

20 monoclonal antibodies. Hybridomas producing these antibodies were 
generated using splenocytes from CD20" A mice that were immunized with 
CD20-GFP cDNA-transfected 300.19 cells. Ten hybridomas secreted 
antibodies reactive with 300.19 (Figure 10H) and CHO (Figure 101) cells 
transfected with CD20-GFP cDNA, but not with untransfected CHO or 

25 300.19 cells (Table 6). These antibodies also reacted with CD20 epitopes 
expressed on the cell surface of B220 + splenocytes from wild type mice, but 
not with splenocytes from CD20" / ' mice (Figure 10J). Therefore, targeted 
mutation of the CD20 gene abrogated cell surface CD20 protein expression. 
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Table 6 

Anti-CD20 Monoclonal Antibodies Generated in CD20' / ' Mice 1 









Whole Cell ELISA 3 


FACS Analysis' 3 




Ab Name 


Clone Name 


Isotype 


CD20-CHO CHO 
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+ 


MB20-8 


MCD20-270 


lgG2b, K 


+ 


+ 


+ 


MB20-10 


MCD20-388 


lgG2b, K 


+ 


+ 


+ 


MB20-11 


MCD20-392 


lgG2a, K 


+ 


+ 


+ 


MB20-13 


MCD20-624 


lgG3, K 


+ 


+ 


++ 


MB20-14 


MCD20-642 


IgGI.K 


+ 


+ 


++ 



a Values represent reactivity of the monoclonal antibody with adherent 
5 monolayers of CHO cells either transfected or untransfected with CD20-GFP 
cDNA as assessed by a cell-based ELISA. The monoclonal antibodies did 
not react with GFP cDNA-transfected CHO cells. 

b Cell surface reactivity of the monoclonal antibody with single cell 
suspensions of 300.19 cells either transfected or untransfected with CD20- 
10 GFP cDNA or spleen cells from wild type mice. Values represent relative 
indirect immunofluorescence staining intensity as assessed by flow 
cytometry and shown in figure 10H-J. 
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VIII. B. B Cell Development and Function in CD20" / ' Mice 

CD20~ / " mice did not show an obvious propensity for infections during 

their first year of life. They had normal frequencies of IglVI" B220'° pro/pre-B 

cells, lgM + B220 10 immature B cells and lgM + B220 hi mature B cells in the 
5 bone marrow (Figure 11, Table 7). Overall, the number of circulating and 

spleen lgM + B220 + B cells found in CD20"/" mice was increased compared 
with wild type littermates (Table 7). However, an immunohistochemical 
analysis of spleen tissue sections revealed a normal architecture and 
organization of the spleen. In the bone marrow, overall IgM expression was 
10 decreased on immature B cells, yet increased on mature B cells when 
compared with IgM levels expressed by comparable cells in wild type 

littermates. However, overall IgM expression by mature B220 hi B cells in 
the blood, spleen and lymph nodes was slightly lower in CD20^- mice 
(Figure 11B-D). There were no obvious differences in the size (light scatter 

15 properties) of CD20"/- B cells isolated from bone marrow, blood, lymph 
nodes or spleen when compared with B cells from wild type littermates. 
These data therefore suggest that CD20 plays a functional role in the 
development and tissue localization of B cells. 



20 



WO 02/062946 



72 



PCT/US01/48437 



Table 7 



Frequencies and Numbers of B Lymphocytes in CD20' / " Mice 



Tissue 



Phenotype Wild Type CD20" / " Wild Type CD20 



% of B Lymphocytes B cell numbers (x10"' 



Bone Marrow B220 lo !gM- 36 ± 2 34 ± 3 



B220'°lgM + 



19±2 



13 ±2* 



B220 hi lgM + 



14±2 



16±4 



Blood d 



B220 + lgM' 



61 ±2 



60 ±3 



3.6 ±0.5 3.9 ± ' 



Spleen 



B220 + lgM + 



51 ±6 



53 ±5 



58 ± 12 



76 ± 



Lymph Nodes 6 B220 + lgM + 26 ± 6 



19 + 2 1.2 ±0.3 0.9 ± 



Peritoneum B220 + lgM' 



70 ±4 



69 ±5 2.4 ±0.3 3.1 ± 



B220'°CD5 + 44 ± 4 



15 ±5* 



1.5 ±0.2 0.7 ±( 



B220 hi CD5" 28 ± 2 59 ± 3* 



1.0 ±0.1 2.7 ±( 



10 



a Values represent mean (± SEM) results obtained from seven 2-month-old of 
wild type controls and 10 CD20"'' mice. Numbers represent the percentage 
of lymphocytes (based on side and forward light scatter properties) 
expressing the indicated cell surface markers. 

b B cell numbers were calculated based on the total number of cells 
harvested from the indicated tissues. 

d The values indicate the number of cells/ml. 

e Values represent results from peripheral lymph nodes pairs. 

*The percentage or number was significantly different than in wild-type, p < 
0.05; **p< 0.01. 
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Within the peritoneal cavity, the number of lgM + B220 + B cells in 
CD20-/" mice was similar to that of wild-type littermates (Table 7, Figure 
1 1 E). However, there was a 4-fold decrease in the number of CD5 + B220 l0 
B1a cells, with a compensatory increase in the number of CD5" B220 ni B2 
5 cells. Therefore, CD20-deficiency predominantly affected the development 
or clonal expansion of the B1 subpopulation of B cells within the peritoneal 
cavity. Exemplary methods for quantitating B cell populations are described 
in Example 7. 

Vlll.C. Reduced fCa ++ 1i Responses in CD20 -/ ' B Cells 

10 The loss of CD20 significantly altered early B cell signaling 

responses, measured as described in Example 8. Splenic B220 + B cells 
from CD20 _/ " mice generated substantially reduced [Ca ++ ]i responses 
following surface IgM ligation when compared with wild type B cells. 
Decreased [Ca ++ ]i responses in CD20" 7 ' B cells were observed in response 

15 to both optimal (40 ug/ml, Figure 12A) and suboptimal concentrations (5 
ug/ml) of anti-IgM antibodies. Although the kinetics of [Ca ++ ]i responses in 
CD20"'" B cells was not altered, the magnitude of both the immediate [Ca ++ ]i 
increase and the sustained increase observed at later time points were 
inhibited by loss of CD20 expression. More dramatic decreases in [Ca ++ ]i 

20 responses (>50%) by CD20 _/ " B cells were observed in response to CD19 
ligation with optimal concentrations (40 ug/ml) of antibody (Figure 12A). 
Reduced [Ca ++ ]i responses following CD19 ligation on CD20"'" B cells were 
likely to result from differences in signaling capacity since Thapsigargin- 
induced (Figure 12A) and lonomycin-induced [Ca ++ ]i responses were higher 

25 in CD20" A B cells than in wild type B cells. In addition, CD19 expression 
levels were not significantly different between CD20" /_ and wild type B cells 
(Figure 12A). 

Chelation of extracellular calcium with EGTA reduced the kinetics and 
magnitude of the immediate [Ca ++ ]i increase observed following IgM 
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crosslinking (Figure 12A). However, the [Ca ++ ]i increase observed at later 
time points was not substantially inhibited by EGTA treatment. Similar 
results were observed in CD20" A B cells. By contrast, chelation of 

extracellular calcium with EGTA almost eliminated the [Ca ++ ]i response 
5 observed following CD19 crosslinking (Figure 12A). This suggests that 
transmembrane Ca ++ flux contributes substantially to the [Ca ++ ]i responses 
observed following CD19 crosslinking. That CD20-deficiency had a 
substantial effect on CD19-induced [Ca ++ ]i responses suggests that CD20 
can contribute significantly to transmembrane Ca ++ flux. 

10 The consequences of CD20 loss on transmembrane signal 

transduction was further evaluated by assessing total cellular protein 
tyrosine phosphorylation in purified B cells following IgM ligation. Although 
some variation was observed between B cells from individual mice in 
individual experiments, overall levels of tyrosine phosphorylation in resting 

15 splenic B cells were higher in CD20" 7 ' B cells than in wild type mice (Figure 
12C). In addition, protein phosphorylation in B cells from CD20" /_ mice 
increased more significantly after B cell antigen receptor (BCR) ligation than 
in wild type B cells. Thus, while CD20 expression can influence BCR- 

induced tyrosine phosphorylation, decreased [Ca ++ ]i responses in CD20" 7 " B 
20 cells are unlikely to result from significant abnormalities in transmembrane 
signaling through the BCR. 



IX. Therapeutic Applications 

Another aspect of the present invention is a therapeutic method 
25 comprising administering to a subject a substance that modulates MS4A 
biological activity. Therapeutic substances include but are not limited to 
chemical compounds, antibodies, and gene therapy vectors. Substances 
that are discovered by the methods disclosed herein are useful for 
therapeutic applications related to disorders of MS4A function. 



WO 02/062946 



75 



PCT/US01/48437 



In one embodiment, the present invention provides a method for 
disrupting MS4A function by immunizing a subject with an effective dose of 
the disclosed MS4A polypeptide. The immune system of the subject 
produces an antibody that specifically recognizes the MS4A polypeptide, 
5 and binding of the antibody to the MS4A polypeptide abolishes MS4A 
function. 

In another embodiment, the present invention provides MS4A nucleic 
acid sequences and gene therapy methods for modulating MS4A activity in a 
target cell. The gene therapy vector can encode a MS4A or sequences 
10 encoding a nucleic acid molecule, peptide, or protein that interacts with a 
MS4A protein. 

Vehicles for delivery of a gene therapy vector include but are not 
limited to a liposome, a cell, and a virus. Preferably, a cell is transformed or 
transfected with the DNA molecule or is derived from such a transformed or 

15 transfected cell. Alternatively, the vehicle is a virus, including a retroviral 
vector, adenoviral vector or vaccinia virus whose genome has been 
manipulated in alternative ways so as to render the virus non-pathogenic. 
Methods for creating such a viral mutation are detailed in U.S. Patent No. 
4,769,331. Exemplary gene therapy methods are also described in U.S. 

20 Patent Nos. 5,279,833; 5,286,634; 5,399,346; 5,646,008; 5,651,964; 
5,641,484; and 5,643,567. 

The therapeutic methods of the present invention can be applied in 
the treatment of a variety of conditions, including in the treatment of non- 
Hodgkin's lymphoma and in the treatment of atopic disorders or other 

25 allergenic diseases. Application of the present inventive therapeutic 
methods are evidenced by the current U.S. Food and Drug Administration 
approved use of antibodies against CD20 in the treatment of non-Hodgkin's 
lymphoma. Additionally, the therapeutic methods of the present invention 
are illustrated in view of the recognition in the art that genetic variations at 

30 chromosome 11Q12-13 can also play a role in the pathogenesis of atopic 
disorders and other allergenic diseases. Indeed, it has been recognized that 
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FcsRip contributes to such diseases, and thus the MS4A genes identified in 
accordance with the present invention are envisioned also to contribute to 
allergenic disease. Therefore the present therapeutic methods, which 
pertain to the modulation of the biological activity of an MS4A polypeptide of 
5 the present invention have application with respect to the treatment of such 
disorders. 

X. Summary 

The invention comprises 19 new genes that are members of a class 
10 of genes encoding MS4A proteins. Three members have been described, 
CD20, FceRIp, and HTm4. A gene family has been defined based on a 
shared chromosomal location, conservation of protein size and structure, 
gene structure conservation, and similar expression in hematopoietic cells. 
MS4A proteins function as oligomeric cell surface complexes, and complex 
15 assembly using diverse MS4A members is implicated as a mechanism for 
regulating complex function. 

Two members of this class, CD20 and FcsRip, have been described 
functionally, and in each case an important function has been delineated. 
CD20 is required for cell cycle progression and signal transduction in B 
20 lymphocytes. CD20 also regulates Ca ++ conductance, possibly as a cation 
channel subunit. Of clinical relevance, antibodies that recognize CD20 are 
effective in treating non-Hodgkin's lymphoma. FceRIp mediates interactions 
with IgE-bound antigens that lead to degranulation of mast cells, and 
variation of the FcsRip locus is implicated in allergenic disease. 

25 The utility of the MS4A genes is based in part on overlapping or 

shared functions with known MS4A members. In one case, new MS4A 
genes have important potential as part of a CD20 complex. The structural 
description of CD20 complexes suggests that one or more CD20-related 
proteins constitute the functional complex. Thus, new MS4A proteins can 

30 define antigens useful for lymphoma treatment. In another case, MS4A 
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genes are implicated in IgE responses. Atopic disorders (allergy, asthma, 
eczema, allergic rhinitis) are dysfunctional IgE responses and are associated 
with a iocus on human chromosome 1 1 q containing most members of the 
MS4A gene family. FcsRip is one relevant factor, and recent work supports 
5 that FceRip as well as other genetic elements in the region contribute to the 
disease. Thus, as disclosed herein, the present MS4A sequences also have 
utility in the characterization, diagnosis, and potential treatment of atopy 
linked to the chromosomal location wherein MS4A genes are located. 



10 Examples 

The following Examples have been included to illustrate modes of the 
invention. Certain aspects of the following Examples are described in terms 
of techniques and procedures found or contemplated by the present co- 
inventors to work well in the practice of the invention. These Examples 
15 illustrate standard laboratory practices of the co-inventors. In light of the 
present disclosure and the general level of skill in the art, those of skill will 
appreciate that the following Examples are intended to be exemplary only 
and that numerous changes, modifications, and alterations can be employed 
without departing from the scope of the invention. 

20 Example 1 

Database Searches and cDNA Isolation 

Three hundred and thirty seven nucleotide sequences obtained from 
the translated GenBank database of expressed sequence tags (ESTs) were 
assembled into sixty-two subgroups of contiguous linear segments based on 

25 their overlapping sequences and potential for encoding proteins homologous 
with CD20. Based on these subgroups, EST cDNAs (Figure 1) were 
obtained from the ATCC and sequenced. Based on the complete 
sequences of twenty-one near full-length EST cDNAs, eleven novel genes 
were defined in human and mouse that unified multiple EST subgroups. 

30 Near full-length EST clones representing these genes are shown in Figure 1. 
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These eleven genes and five additional genes were also identified by PCR 
amplification of transcripts using subgroup-specific primers or primers based 
on EST sequences. The specific details of how cDNAs representing the five 
genes that were not identified by EST cDNA clones are indicated below. In 
5 all cases, ESTs and cDNAs encoding the predicted coding regions of each 
putative unique gene were sequenced in both directions and at least two 
independent ESTs and/or cDNAs representing near full-length gene 
products were sequenced. Thereby, there was independent confirmation of 
accuracy for all of the sequences reported. 

10 Based on EST subgroup sequences, cDNAs encoding mouse 

MS4a4B and MS4a4C were isolated by PCR amplification of C57BL/6 
mouse spleen cDNA using both Taq and Pfu DNA polymerase. Primers for 
MS4a4B (SEQ ID NOs:63-64) amplified an 879 bp fragment. Primers for 
MS4a4C (SEQ ID NOs:65-66) amplified a 794 bp fragment. EST sequences 

15 for MS4a4D only encoded the 3' end of the predicted protein. Since 
MS4a4D sequences were closely related to MS4a4B and MS4a4C 
sequences, a sense 5' primer (SEQ ID NO:67) based on consensus 
MS4a4B and MS4a4C sequences and a MS4a4D-specific antisense primer 
(SEQ ID NO:68) were used to amplify a 773 bp fragment from cDNA of 

20 C57BL/6 mouse lung. 

MS4a6C was initially identified based on one unique EST sequence 
(AA028258) encoding a mouse protein homologous with the C-terminal end 
of MS4a6B. MS4a6C cDNAs were isolated by PCR amplification of 
C57BL/6 mouse bone marrow cDNA using Taq polymerase. A primer based 

25 on identical sequences at the 5' end of the MS4a6B and MS4a6D cDNAs 
(SEQ ID NO:69) was used in combination with an antisense primer specific 
for the unique EST sequence (SEQ ID NO:70) to amplify a 787 bp fragment. 
Sequences from multiple independent PCR-amplified cDNAs were identical. 
Subsequently, the PCR-generated 5* end of the near full-length MS4a6C 

30 cDNA was found to be identical to an orphan EST subgroup sequence that 
had not been linked with defined 3' sequences. Thereby, the EST subgroup 
sequences verified that the PCR-amplified 5' end of the MS4a6C cDNAs 
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was appropriate. In addition, the overall MS4a6C sequence was similar to 
the sequence of MS4a6B cDNAs without interruption. Thus, the MS4a6C 
cDNA united sequences identical to those found in two non-overlapping 
CD20-homologous EST subgroups. cDNAs encoding a 473 bp fragment of 
5 mouse MS4a3 were amplified from cDNA of C57BL/6 bone marrow as 
described above. Primers (SEQ ID NOs:71-72) were obtained based on a 
single thymic cDNA EST sequence (GenBank AA940479) where the 
corresponding cDNA was not available. 

Human MS4A and mouse MS4a cDNA sequences (MS4A1 to 
10 MS4A12) (disclosed herein) were used to search the htgs GenBank human 
genomic database of unfinished human genomic sequences 
(http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program. Seventeen 
phase 1 or phase 2 human genomic DNA sequences encoding potential 
MS4A genes were assembled into groups of contiguous linear segments 
15 based on their overlapping sequences. Three EST clones corresponding to 
partial MS4A6E transcripts were obtained from the ATCC and sequenced 
completely on both DNA stands. 

All PCR-amplified cDNAs were subcloned and sequenced entirely in 
both directions. Complete sequencing of at least two distinct PCR- 

20 generated cDNAs from both Taq and Pfu enzyme was performed in most 
cases. Differences between cDNA sequences were only noted when 
multiple cDNA clones generated by both Taq and Pfu polymerases revealed 
identical differences. In some cases, cDNAs or EST sequences contained 
potential intronjexon splice sites that delimited structural domains and 

25 aligned with the known intron|exon splice sites of CD20 (Tedder et al. 
(1989b) J Immunol 142:2560-2568). In these cases, potential introns were 
flanked by consensus splice donor and/or splice acceptor sequences (Aebi 
& Weissmann (1987) Trends Genet 3:102-107) or were likely to represent 
splice variants where exons were deleted. 



WO 02/062946 



80 



PCT/US01/48437 



Example 2 

RNA Isolation and Reverse Transcription-PCR 

Reverse transcription-PCR amplification (RT-PCR) was as described 
previously (Zhou & Tedder, 1995) with minor modifications. Total RNA was 
5 extracted from 1-2 x 10 7 hematopoietic cell lines using a RNeasy Mini Kit 
(Qiagen, Inc., Chatsworth, California) according to the manufacturer's 
instructions. Human hematopoietic cell lines included one pre-B cell line 
(NALM-6), three B cell lines (BJAB, DAUDI, and SB), four T cell lines (HSB- 
2, HUT-78, JURKAT, and MOLT15), two myelomonocytic lines (HL60 and 

10 U937), and one erythroleukemia cell line (K562). RNA concentrations were 
determined by UV absorbance. Ten [ig of total RNA was reverse 
transcribed. In some cases, cDNA from any of 8 different human tissues 
(colon, ovary, blood mononuclear cells, prostate, small intestine, spleen, 
testes, and thymus; from CLONETECH Laboratories, Inc., Palo Alto, 

15 California) was analyzed. RT-PCR amplification was performed using gene- 
specific primers identical with protein coding regions of the predicted MS4A 
genes during 35 cycles (94°C for 1 min, 55°C for 1.5 min, 72°C for 1.5 min, 
followed by extension at 72°C for 5 min). Following amplification, the PCR 
products were separated on 1% agarose-ethidium bromide gels and 

20 photographed. G3PDH, a housekeeping gene, was also amplified to control 
for sample to sample variation. RNA amplified without reverse transcription 
was used as a negative control, and was negative in all cases. 

Example 3 

Recombinant Production of MS4A Protein 

25 For recombinant production of a protein of the invention in a host 

organism, a nucleotide sequence encoding the protein is inserted into an 
expression cassette designed for the chosen host and introduced into the 
host where it is recombinantly produced. The choice of the specific 
regulatory sequences such as promoter, signal sequence, 5' and 3' 

30 untranslated sequence, and enhancer appropriate for the chosen host is 
within the level of ordinary skill in the art. The resultant molecule, containing 
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the individual elements linking in the proper reading frame, is inserted into a 
vector capable of being transformed into the host cell. Suitable expression 
vectors and methods for recombinant production of proteins are well known 
for host organisms such as E. coli, yeast, and insect cells (see, e.g., 
5 Lucknow & Summers (1988) Bio/Technol 6:47). Additional suitable 
expression vectors are baculovirus expression vectors, e.g., those derived 
from the genome of Autographies californica nuclear polyhedrosis virus 
(AcMNPV). 

Recombinantly produced proteins are isolated and purified using a 
10 variety of standard techniques. The actual techniques used varies 
depending upon the host organism used, whether the protein is designed for 
secretion, and other such factors. Such techniques are well known to the 
skilled artisan. See Ausubel et al. (1994). 

Example 4 

15 Mouse Anti-Mouse CD20 Monoclonal Antibody Production 

Hybridomas producing CD20-specific mouse monoclonal antibodies 
were generated by the fusion of NS-1 myeloma cells with spleen cells from a 
CD20~ / " mouse immunized with a cell line expressing a mouse CD20-GFP 
fusion protein. The CD20-GFP fusion protein was generated by subcloning 
a fragment of the pmB1-1 cDNA (from 159 to 1050 bp of SEQ ID NO:39) into 
the PEGFP-N1 vector (Clonetech Laboratories Inc., Palo Alto, California) to 
generate an open reading frame encoding the entire CD20 protein with GFP 
fused to the carboxyl-terminal end. The resulting plasmid was linearized 
with ApaL I and used to transfect 300.19 cells, a mouse pre-B cell line, and 
Chinese Hamster Ovary (CHO) cells. Transfection was by Lipofectamine 
following the manufacturer's instructions (Clonetech Laboratories, Inc.). 
Transfected cells were selected using GENETICIN™ (1 mg/ml, GIBCOBRL) 
in RPMI 1640 media (Sigma) for 300.19 cells or H-12 nutrient mixture 
(GIBCOBRL) for CHO cells. Both media were supplemented with 10% FCS, 
L-glutamine, streptomycin and penicillin. Transfected cells expressing high 
levels of CD20-GFP were isolated by fluorescence-based cell sorting. 



20 



25 



30 
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Example 5 

In vitro Binding Assays 

Recombinant protein can be obtained, for example, according to the 
approach described in Example 4 herein above. The protein is immobilized 
5 on chips appropriate for tigand binding assays. The protein immobilized on 
the chip is exposed to sample compound in solution according to methods 
well known in the art. While the sample compound is in contact with the 
immobilized protein, measurements capable of detecting protein-ligand 
interactions are conducted. Measurement techniques include, but are not 
10 limited to, SEDLI, Biacore, and FCS, as described above. Compounds 
found to bind the protein are readily discovered in this approach and are 
subjected to further characterization. 

Example 6 

Generation of CD20-Deficient Mice 

15 DNA encoding the CD20 gene was isolated from a phage library 

prepared from 129/Sv strain mouse DNA (Figure 10A), mapped with 
restriction endonucleases, and sequenced to identify intron|exon boundaries 
(Figure 10B). The targeting vector was constructed using a pBluescript SK 
(Stratagene, La Jolla, California)-based targeting vector (p594, provided by 

20 Dr. David Milstone, Brigham and Women's Hospital, Boston, 
Massachusetts). A DNA fragment starting at the Pst I site in CD20 exon 5 
through the EcoR V site in exon 6 (-1.8 kb) was isolated and blunt end 
ligated into the targeting vector downstream of the pMC1-HSV thymidine 
kinase gene and upstream of the neomycin resistance marker obtained from 

25 pGK-neo poly A (Stratagene) that contained the PGK promoter and poly A 
signal sequence. An -10 kb DNA fragment beginning at the Kpn I site 
downstream of exon 8 was also isolated and inserted into the targeting 
vector downstream of the neomycin resistant gene. The plasmid was 
linearized using a unique Sal I restriction site proximal to the 3' end of the 

30 CD20 gene insert and used to transfect ES cells. 
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ES ceils were transfected with linearized plasmid DNA and selected 
for G418 resistance as described (Keller and Smithies (1989) Proc Natl 
Acad Sci USA 886:8932). Genomic DNA from individual selected clones 
was digested with EcoR V and used for Southern blot analysis along with a 
5 radiolabeled -1.5 kb DNA probe that was external to the targeting vector 
(Figure 10D). A 4.6 kb genomic DNA fragment hybridized with the probe in 
wild type ES cells or a 6.3 kb fragment in appropriately targeted ES cells (Fig 
1E). Genomic DNA generated by BamH I, Ssc I or Kpn I digestion was also 
analyzed for appropriate targeting. The Southern blot pattern obtained in all 

10 cases was consistent with the appropriate predicted mutation indicating that 
detrimental recombinations did not occur in the vicinity of the desired 
homologous recombination. Cells from appropriately targeted ES cell clones 
were injected into 3.5 day old C57BL/6 blastocysts that were transferred into 
foster mothers. Offspring carrying the mutant CD20 allele were identified by 

15 Southern blot analysis of DNA obtained from tail biopsies. 

High chimeric males (80-100% according to color) were bred with 
C57BL/6 (B6) females to generate heterozygous offspring with germline 
gene transmission, which were crossed to generate the homozygous CD20" A 
and wild type littermates used for this study. In some cases, B6/129F1J 

20 (Jackson Laboratory) were used as controls. Results obtained using wild 
type littermates of CD20 +/ * mice were similar and were therefore pooled. All 
mice were between 2-3 months of age when used for this study. Mice were 
housed in a specific pathogen-free barrier-facility. All studies and 
procedures were approved by the Animal Care and Use Committee of Duke 

25 University. 

Example 7 

Flow Cytometric Analysis of Lymphocyte Subsets 

Single cell suspensions of lymphocytes from the spleen, bone 
marrow, peripheral lymph nodes, and peritoneal cavity were isolated from 
30 CD20" A and wild type mice and counted using a hemocytometer prior to two- 
color immunofluorescence analysis. Retroorbital venous plexus puncture 
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was utilized to obtain circulating leukocytes. Leukocytes (0.5 x 10 6 ) were 
stained at 4°C using predetermined optimal concentrations of the test 
monoclonal antibody for 20 min as described (Zhou et al. (1994) Mol Cell 
Biol 14:3884-3894). Blood erythrocytes were lysed after staining using the 
5 Coulter Whole Blood Immuno-Lyse kit as detailed by the manufacturer 
(Coulter, Inc., Miami, Florida). Cells were washed and analyzed on a 
FACScan flow cytometer (Becton Dickinson, San Jose, California). 

Antibodies used in this study included the following: biotin, FITC- 
conjugated anti-B220 Mab (CD45RA, RA-3, 6B2, provided by Dr. Robert 

10 Coffman, DNAXCORP, Palo, Alto, California); PE-conjugated anti-mouse 
Thy1.2 (Caltag Laboratories, Burlingame, California); B220-PE (Caltag 
Laboratories, Burlingame, California); biotin-conjugated anti-l-A (BD 
PharMingen, Franklin Lakes, New Jersey); PE or APC-conjugated anti-CD5 
(BD PharMingen); PE-conjugated goat anti-mouse lgG3-specific antibody 

15 (Southern Biotechnology Associates Inc., Birmingham, Alabama); and biotin- 
conjugated anti-mouse IgD (Southern Biotechnology Associates Inc., 
Birmingham, Alabama). FITC or biotin-conjugated goat anti-mouse IgM 
isotype-specific antibodies (Southern Biotechnology Associates Inc., 
Birmingham, Alabama) were also used. 

20 Phycoerythrin-conjugated Streptavidin (Southern Biotechnology 

Associates Inc., Birmingham, Alabama) was used to reveal biotin-coupled 
monoclonal antibody staining. The percent positively stained lymphocytes 
was determined using a FACScan flow cytometer (Becton Dickinson, San 
Jose, California). Positive and negative populations of cells were 

25 determined by using unreactive monoclonal antibody (Caltag Laboratories, 
Burlingame, California) as controls for background staining. Background 
levels of staining were delineated using gates positioned to include 98% of 
the control cells. Ten thousand cells with the forward and side light scatter 
properties of lymphocytes were analyzed for each sample. 



WO 02/062946 



85 



PCT/US01/48437 



Example 8 

Intracellular Calcium Measurements 

Changes in lymphocyte [Ca 2+ } levels were monitored by flow 
cytometry analysis as described (Fujimoto et al. (1999) Immunity 11:191). 
5 Single cell suspension of splenocytes were resuspended (1 x 10 7 /ml) in 
RPMI 1640 medium containing 5% FBS, 10 mM HEPES and loaded with 
1 |nM of indo-1-AM for 30 min at 37°C. Splenocytes were then washed and 
incubated with a predetermined optimal concentration of FITC-conjugated 
anti-B220 monoclonal antibody for 15 min at room temperature. The 

10 splenocytes were washed again and resuspended at 2 x 10 6 /ml in medium. 
The fluorescence ratio (405/525 nm) of B220 + splenic B cells was monitored 
by flow cytometry at baseline for 1 min and for 6 min after stimulation with 
optimal and suboptimal concentrations of goat F(ab')2 anti-IgM antibody (5- 
40 |j,g/ml), optimal concentrations of anti-mouse CD19 monoclonal antibody 

15 (40 fag/ml), Thapsigargin (1 jig/ml; Sigma), or lonomycin (2.67 jag/ml; 
Calbiochem Biosciences, Inc., La Jolla, California). In some cases, EGTA (5 
mM final; pH 7.0) was added to the cells, immediately followed by 
stimulation with the inducing agents described above. Results were plotted 
as the fluorescence ratio at 20 sec intervals with background fluorescence 

20 subtracted. An increase in the fluorescence ratio indicates an increase in 
[Ca 2+ ]i. 

Example 9 

Characterization of a MS4A Promoter Region 
A preferred in vitro technique for evaluating MS4A promoter function 
25 is a transient transfection assay. According to this method, one or more 
chimeric reporter genes comprising a MS4A promoter region is introduced 
into a relevant host cell (e.g., a hematopoietic cell), and the resulting level of 
reporter gene expression is quantitated. Representative methods for making 
an expression system comprising a promoter region operably linked to a 
30 heterologous reporter sequence are disclosed in U.S. Patent No. 6,087,1 11. 

To analyze the function of a MS4A promoter region in vivo, transgenic 
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mice bearing a chimeric gene comprising a MS4A promoter region are 
generated, and a level of reporter gene expression in each mouse is 
determined. 

Within a candidate promoter region or response element, the 
5 presence of regulatory proteins bound to a nucleic acid sequence can be 
detected using a variety of methods well known to those skilled in the art 
(Ausubel et al., 1992). Briefly, in vivo footprinting assays demonstrate 
protection of DNA sequences from chemical and enzymatic modification 
within living or permeabilized cells. Similarly, in vitro footprinting assays 

10 show protection of DNA sequences from chemical or enzymatic modification 
using protein extracts. Nitrocellulose filter-binding assays and gel 
electrophoresis mobility shift assays (EMSAs) track the presence of 
radiolabeled regulatory DNA elements based on provision of candidate 
transcription factors. Computer analysis programs, for example TFSEARCH 

15 version 1.3 (Yutaka Akiyama: "TFSEARCH: Searching Transcription Factor 
Binding Sites", http://www.rwcp.or.jp/papia/), can also be used to locate 
consensus sequences of known cis-regulatory elements within a genomic 
region. 



20 
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CLAIMS 

What is claimed is: 

1 . An isolated MS4A polypeptide, or functional portion thereof, 
comprising: 

5 (a) a polypeptide encoded by the nucleotide sequence of 

any one of the odd-numbered SEQ ID NOs:1-37; 

(b) a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered 
SEQIDNOs:1-37; 

10 (c) a polypeptide having the amino acid sequence of any 

one of the even-numbered SEQ ID NOs:2-38; 

(d) a polypeptide that is a biological equivalent of the 
polypeptide of any one the even-numbered SEQ ID 
NOs:2-38; or 

15 (e) a polypeptide which is immunologically cross-reactive 

with an antibody that shows specific binding with a 
polypeptide of any one of the even-numbered SEQ ID 
NOs:2-38. 

2. An isolated nucleic acid molecule encoding a MS4A 
20 polypeptide, comprising: 

(a) the nucleotide sequence of any one of the odd- 
numbered SEQ ID NOs:1-37; or 

(b) a nucleic acid molecule substantially identical to any 
one of the odd-numbered SEQ ID NOs:1-37. 

25 3. The isolated nucleic acid molecule of claim 2, comprising a 20 

nucleotide sequence that is identical to a contiguous 20 nucleotide sequence 
of any one of the odd-numbered SEQ ID NOs:1-37. 

4. A chimeric gene, comprising the nucleic acid molecule of claim 
2 operably linked to a heterologous promoter. 



WO 02/062946 



93 



PCT/US01/48437 



5. A vector comprising the chimeric gene of claim 4. 

6. A host cell comprising the chimeric gene of claim 4. 

7. The host cell of claim 6, wherein the cell is selected from the 
group consisting of a bacterial cell, a hamster cell, a mouse cell, and a 

5 human cell. 

8. A method of detecting a nucleic acid molecule that encodes a 
MS4A polypeptide, the method comprising: 

(a) procuring a biological sample comprising nucleic acid 
material; 

10 (b) hybridizing the nucleic acid molecule of claim 2 under 

stringent hybridization conditions to the biological 
sample of (a), thereby forming a duplex structure 
between the nucleic acid of claim 2 and a nucleic acid 
within the biological sample; and 

15 (c) detecting the duplex structure of (b), whereby a MS4A 

nucleic acid molecule is detected. 

9. An antibody that specifically recognizes a MS4A polypeptide of 
claim 1. 

10. A method for producing an antibody that specifically 
20 recognizes a MS4A polypeptide, the method comprising: 

(a) recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof; 

(b) formulating the polypeptide of (a) whereby it is an 
effective immunogen; 

25 (c) administering to an animal the formulation of (b) to 

generate an immune response in the animal comprising 
production of antibodies, wherein antibodies are present 
in the blood serum of the animal; and 
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(d) collecting the blood serum from the animal of (c), the 
blood serum comprising antibodies that specifically 
recognize a MS4A polypeptide. 

11. A method for detecting a level of MS4A polypeptide, the 
5 method comprising 

(a) obtaining a biological sample comprising peptidic 
material; and 

(b) detecting a MS4A polypeptide in the biological sample 
of (a) by immunochemical reaction with the antibody of 

10 claim 9, whereby an amount of MS4A polypeptide in a 

sample is determined. 

12. A method for identifying a substance that modulates MS4A 
function, the method comprising: 

(a) isolating a MS4A polypeptide of claim 1 ; 

15 (b) exposing the isolated MS4A polypeptide to a plurality of 

substances; 

(c) assaying binding of a substance to the isolated MS4A 
polypeptide; and 

(d) selecting a substance that demonstrates specific 
20 binding to the isolated MS4A polypeptide. 

13. A method for modulating MS4A function in a subject, the 
method comprising: 

(a) preparing a pharmaceutical composition, comprising a 
substance identified according to the method of claim 10 

25 or 12, and a carrier; and 

(b) administering an effective dose of the pharmaceutical 
composition to a subject, whereby MS4A activity is 
altered in the subject. 
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14. The method of claim 13, wherein the substance is an antibody, 
a protein, a peptide, or a chemical compound. 

15. The method of claim 13, wherein MS4A activity is regulation of 
the abundance of target cell subpopulations. 

5 16. The method of claim 13, wherein MS4A activity is regulation of 

[Ca 2 l levels. 

17. A method for identifying a candidate compound as a modulator 
of MS4A gene expression, the method comprising: 

(a) exposing a cell sample with a candidate compound to be 
10 tested, the cell sample containing at least one cell containing a 

DNA construct comprising a modulatable transcriptional 
regulatory sequence of a MS4A-encoding nucleic acid and a 
reporter gene which is capable of producing a detectable 
signal; 

15 (b) evaluating an amount of signal produced in relation to a control 

sample; and 

(c) identifying a candidate compound as a modulator of MS4A 
gene expression based on the amount of signal produced in 
relation to a control sample. 

20 18. The method of claim 17, wherein the modulatable 

transcriptional regulatory sequence of a MS4A-encoding nucleic acid 
comprises a sequence that is immediately upstream of the initial coding 
region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

19. A method for modulating MS4A function in a subject, the 
25 method comprising: 

(a) preparing a gene therapy vector having a nucleotide 
sequence encoding a MS4A polypeptide or a nucleotide 
sequence encoding a nucleic acid molecule, peptide, or 
protein that interacts with a MS4A nucleic acid or 
30 polypeptide; and 
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administering the gene therapy vector to a subject, 
whereby the function of MS4A in the subject is 
modulated. 
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Figure 3 
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A6E MTSQPI SNETI IM 

A6E TT GGC AAC ACC ATT ATG ACA TCA CAA CCT ATT TCC AAT GAG ACC ATC ATA ATG 53 

A6A TT GGC AAC ACC ATC ATG ACA TCA CAA CCT GTT CCC AAT GAG ACC ATC ATA GTG 277 

A6E L PSNVINF SQAE K PE PTNQG 

A6E CTC CCA TCA AAT GTC ATC AAC TTC TCC CAA GCA GAG AAA CCC GAA CCC ACC AAC CAG GGG 113 

A6A CTC CCA TCA AAT GTC ATC AAC TTC TCC CAA GCA GAG AAA CCC GAA CCC ACC AAC CAG GGG 3 37 

AGE QDSLKKRLQAKVKVIG 

A6E CAG GAT AGC CTG AAG AAA CGT CTA CAG GCA AAA GTC AAA GTT ATT GGG | 161 

A6A CAG GAT AGC CTG AAG AAA CAT CTA CAC GCA GAA ATT AAA GTT ATT GGG | (exons MS4A6A) 397 

A6E VHSSLAGSILSALSALVGFI 

A6E GTG CAT AGC AGC CTG GCT GGA AGC ATT CTG AGT GCT CTG TCT GCC CTG GTG GGT TTC ATT 221 

A6A GTG CAT AGC AGC CTG GTT GGA AGC ATT CTG AGT GCT CTG TCT GCC CTG GTG GGT TTC ATT 637 

A6E LLSVNPAALNPASLQCKLDE 

A6E CTC CTG TCT GTC AAC CCG GCT GCA TTA AAT CCT GCC TCA TTG CAG TGT AAG TTG GAC GAA 281 

A6A ATC CTG TCT GTC AAA CAG GCC ACC TTA AAT CCT GCC TCA CTG CAG TGT GAG TTG GAC AAA 697 

A6E KDIPTRLLLSYDYH SPYTM 

A6E AAG GAT ATA CCA ACC AGA CTT CTT CTT TCT TAT GAT TAT CAT TCA CCT TAC ACC ATG 338 

A6A AAT AAT ATA CCA ACA AGA AGT TAT GTT TCT TAC TTT TAT CAT GAT TCA CTT TAT ACC ACG 757 

A6E DCHRAKASLAGTLSL-MLVST 

A6E GAC TGC CAT AGA GCC AAA GCC AGT CTG GCT 1 GGA ACT CTG TCT CTG ATG CTG GTT TCT ACT 3 98 

A6A GAC TGC TAT ACA GCC AAA GCC AGT CTG GCT J GGA ACT CTC TCT CTG ATG CTG ATT TGC ACT 817 

A6E VLEFCLAV LTAVLQWKQTV * 

A6E GTG TTG GAG TTC TGC CTA GCT GTG ' CTC ACT GCT GTG CTG CAG TGG AAA CAG ACT GTC TGA 4 58 

A6A CTG CTG GAA TTC TGC CTA GCT GTG CTC ACT GCT GTG CTG CGG TGG AAA CAG GCT TAC TCT 877 

A6E C TTC CCT GG | TTA CTC ATG TCC 479 

A6A GAC TTC CCT GGG | AGT GTA CTT TTC CTG CCT CAC AGT TAC ATT GGT AAT TCT GGC ATG TCC 937 

A6E TCA AAG ACG ACT CAT GAT GCT GGA TAT GAA GAA CTA TTG ACT TCT TGG GAA AAA ACG GAG 53 9 

A6A TCA AAA ATG ACT CAT GAC TGT GGA TAT GAA GAA CTA TTG ACT TCT TAA GAA AAA AGG GAG 997 

A6E AAA TAT TAA TTG GAA AGT TGA TTT TTA TGA TAA TAT GGA AAA CCT AAC CAT TAT AAA AAA 599 

A6A AAA TAT TAA TCA GAA AGT TGA TTC TTA TGA TAA TAT GGA AAA GTT AAC CAT TAT AGA AAA 1057 

A6E GCA AA CTT GAG TTT CCT AAA TGT AAG CAT TTA AAG TAA ATG CAT ATT TGT TTT AAA AAA 658 

A6A GCA AAG CTT GAG TTT CCT AAA TGT AAG CTT TTA AAG TAA TG AAC ATT AA AAA AAA CCA 1115 

A6E TTA 66 1 

A6A TTA TTT CAC TGT CA 1129 
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A2 MDTESNRRANLALP QEPS | SVPAFEVLEI SPQEVSSGRLLKSASSPPLHTWLTVLKKEQEFLG 

a2 MDTENRSRADL ALPN PQES S SAPDI ELLEASP AKAAPPKQTWRTFLKKELEFLG 

A3 MASHEVDNAELGSASAHGTPGSETGPEELNTSVYHP I NGS PDYQKAKLQVLG 

33 MKPEETGGSVYQPLDESRHVQRGVLQALG 

A4 A MTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLKGEPKVLG 

a4B MQGQEQTTMAWPGVAVPSKNSVMTSQMWNEKKEKFLKGEPKVLG 

a4 C MQGQEQTTMAWPGGAPPSENSVMKSQMWNENKEKFLKGEPKVLG 

a4D MQGLAQTTMAWPGGAPPSENSVI KSQMWNKNKEKFLKGEPKVLG 

A4 E MTTMQGM EQTT PG PGPDVPQLGN I DVIHS YLCKGLQEKFFKRKPKVLG 

A5 MDSSTAHSPVFLVFPPEITASEYESTELSATTFSTQSPLQKLFARKMKILG 

A6A MTSQPVPNETI I VLPSNVI NFSQAEKPEPTNQGQDSLKKHLHAEI KVIG 

a6B MI PQVVTSEWAMISPNGMSLPQTDKPQPraQWQDSLKI^LKAEIKVMA 

36 C MI PQWTNET ITT IS PNG I N FPQKD ESQPTQQRQDS LKKHLKAE IKVIV 

a6D MI PQWTSETVTVI SPNGI SFPQTDKPQPSHQSQDSLKKHLKAEI KVMA 

A6E MTSQPISNETI IMLPSNVINFSQAEKPEPTNQGQDSLKKRLQAKVKVIG 

A7 MLLQSQTMGVSHSFTPKGITIPQREKPGHMYQNEDYLQNGLPTETTVLG 

a7 MRLQLGTKNIGWDCFPKDI I IHKREKTGHTYEKEDDLLIGVPSEATLLG 

A8B MNSMTSAVPVANSVLWAPHNGYPVTPGIMSHVPLYPNSQPQVHLVPGNPPSLVSNVNGQPVQKALKEGKTLG 
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al2 MMSSKPTTYPGVYGTTPDLYPPSNFMVPGSQQPPGFINPRIQVQSSQ APFI VSPGI PNNSQQVQGNI QMVNPGTGKAATNFKEEAKTLG 
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1 1 S SLTLN T I TSVLAATAS I MGWSVAVGS 



FIFSGSLSIAAGVKP TKAM I ISSLSVNTISSVLAVAASI IGVI SVISGV 



FIISVSLSVAAGIRT TKGL 
FINSGAFLIAVKRKT TETL 
FI I SGSLSI ATEKRL TKLL 
FIVSGILSIVTETKS TKIL 



VGGSLGKN I TSSVLAI SGILI NAI SLT FYS 
I I LSRIMNFLSALGAI AGI ILLTFGFIL 
VHSSLVGS I LSALSALVGFI I LSVKQATLNPASLQ 
VDSSLTLNI LSVSFAFMGI I I ISVSLAGLHPASEQ 
FIASGILSI ITERKS TKPL VDASLTLNI LSVSFAFVGI I I ISVSLAGLHPASEQ 
FAI SG I LS I VTEKKM TKPL VHSSLALSI LSVLSALTGI Al LSVSLAALEPALQQ 
I VHS SLAGS I LSALSALVGFI LLSVNPAALNPASLQ 
FGITGSLSI I SG KQSTKPF [ DLSSLTSNAVSSVTAGAGLFLLADSMVALR 
FAIAGILSl I SEKTS TKPF ALSSLASNVASSWAVIGLFLFTYCLIALG 
FI I SGSLSVAAENQPYSYCL | LSGSLGLN I VSAICSAVGVI LFI TDLS I PH 
FIISGSLSVAAETQPNSPCL LNG SVGLN I FS A I CS AVG IMLFITDISISS 
FLI SGI LAI TMKTFS KTYL | KMLCLMTNLI SLFCVLSGLFVI SKDLFLES 
FLVAGMAAMTTVTFP KTSL KVLCVIANVISLFCALAGFFVIAKDLFLEG 
FIISGSLSVSASKEL SRCL j VKGSLGMNIVSSILAFIGVILLLVDMCING 
FIITGILCILASKKS SPAL IKSSLGMS I VSSFFAFIGMILLLVDESING 



AY IHIHSCQKFFETK 171 
AYMNNCKNVTEDDG 162 
QSLRSCHSSSESPDL 162 

FHHPYCNYYGNSNN 154 
QFP 137 
QFP 137 
FR 136 

FRYH Y CNHDQ L SSN 154 
DQN Y I CGY SHQNSQ 155 
CELDKNNI PTRSYVSYFYHDSLYTTD 174 
CLQSKELRPTEYHYYQ FLDRNE 170 
CKQSKELSLI EHD YYQPFYNSDRSE 173 
CKLAFTQLDTTQDAYHFFSPEPLNS 173 
CKLDEKDIPTRLLLSYDYHSPYT MD 109 
TASQHCGSEMDYLSSLPYSEYYYPI YEIKD 173 
SAFPHCNSEKKFLSLLSYLKSHHWKNEDKN 170 
PYAYPDYYPY 174 
GYI YPSYYPYQE 192 
PFESPIWRMYPNST 164 
PFPWPIWRPYPEPT 161 
VAGQD 192 
LPEQD 189 



[ TM4 ] 

Al CYSIQSLFL | GILSVMLIFAFFQELVIAGIVENEWKRTCSRPKS 
al CNSIQSVFL G I LSAML I SAFFQKLVTAGI VENEWKRMCTRSKS 
A2 CFMASFS | T EIWMMLFLTILGLGSAVSLTICGAGEELKGNK) 
a2 C FVASFTT ELVLMMLFLTILAFCSAVLFTI YRIGQELESKK 
A3 CNYMGSISN GMVSLLL I LTLLELCVTI STI AMWCNANCCNSRE 
A4A CHGTMSILM I GLDGMVLLLSVLEFCIAVSLSAFGCKVLCCTPGG 
a4B FRYNYT I TK | GLDVLML I FNMLEFCLAVSVS AFGCEASCCNSRE 
a4C FRYNYT I TK GLDI LMLI LNMLEFCIAVSI SAFGCKASCCNSSE 
a4D QFRSQPAIA SLDVLMT I LNMLEFC I AVSV SAFGCKASCCNSSE 
A4E CYMTMSILM' 
A5 CKAVTVLFL 
A6A CYTAKASLA 



NIVLLSAEEKKEQTIEIKEEWGLTETSSQPKNEEDI EI I PIQEEEEEETETNFPEPPQDQESSPIENDSSP297 
NWLLSAGEKNEQTIKMKEEI IELSGVSSQPKNEEEIEI IPVQEEEEEEAEINFPAPPQEQESLPVENEIAP2 91 



VPEDRVYEELNI YSATYSELEDPGEMSPP I DL 
VPDDRLYEELNVYSPIYSELEDKGETSSPVDS 
EISSPPNSV 

WLI LP SHSHMAETAS PT P LNEV 

VLWLP SNPVETVMAP PMTLQP LL P S EHQGTNVPGNVYKNH PGE I V . 
VLVVLPSNPAVTVMAPPVTLOPLPPSEHQGKNVPENVYKNHSEE IV 
VLVVLPSNSAVTVTAPPMI LQPLPPSECQGKNVPENLYRNQPGEIV 
FVLITPSNSHMAEI ASPTPLKTV 



GTDGMVLLLSVLEFCI AVSLSAFGCKVLCCSPSE | 
GI LI TLMTFS I I ELFI SLPFSI LGCHSEDCDCEQCC 

GTLS LML ICTLLEFCLAVLTAVLRWKQAYSDFPG | SVLFLPHSY IGNSGMSSKMTHDCGYEELLTS 
a6B CFAAKSVLA GVFS LMLI STMLELGLAVLTAMLWWKQSHSNI PG NVMFLPHSSNNDSNMESKVLCNPSYEEQLVC 
a6C CAVTKSILT GALSVMLI I SVLELGLALLSAMLWLREGVLTSLRM 

a6D CFVAKAALT GVFSLMLISSVLELGLAVLTATLWWKQSSSAFSG NVI FLSQNSKNKSSVSSESLCNPTYENI LTS 
A6E CHRAKASLA J GTLS LMLVSTVLEFCLAVLTAVLQWKQTV 

A7 CLLTSVSLT | GVLWMLI FTVLELLLAAYSSVFWWKQLYSNNPG | SSFSSTQSQDHIQQVKKSSSRSWI 

a7 CYLAYVGAM SALGMMLLFTVLEVFLAGYSSI FWWKQVYSNKPG GTFFLPQSQDHTQLVKSNLLQ 

A8B AWGV j NPGMAI SGVLLVFCLLEFGI ACASSHFGCQLVCCQSSN | VSVIYP NI YAANPVITPEPVTSPPSYSSEIQANK 

a8B ENLGV RTGVAI SSVLLI FCLLELSI ASVSSHFGCQVACCHYNN PGWI P | NVYAANPWI PEPPNPIPSYSEWQDSR 

AlO | VHIQRLELALLCFTVLELFLPVPTAVTAWRGDCPSAK | NDDACLVPNTPLHLKGLPVEPPPSYQSVIQGDAQHKQHQR 

alO TYIQRLELTLFCFTFLEIFLSGSTAITAYPJ^KRLOAEDKDDTPFVPDTPMELKGLSLGPPPSYK^ 

A12 DYWAV | LSGKGI SATLM I FSLLEFFVACATAHFANQANTTTNM j SVLVI PNMYESNPVTPASSSAPPRCNNYSANAPK 

al2 DYWAV LSGKGI SAMLI I FSLLEFCI TCVTAYFASHTITNTRGL SWSFHLCMQTVP 



244 
235 
214 
220 
226 
226 
225 
220 
200 
248 
244 
217 
247 
147 
240 
234 
250 
268 
241 
267 
267 
243 
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Sequence Listing 



<110> Tedder, Thomas 
Liang, Yinghua 

<120> Identification of Novel MS4A Gene Family Members Expressed by 
Hematopoietic Cells 

<130> 180/132 

<160> 81 

<170> Patentln version 3.0 

<210> 1 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1619) 

<220> 

<221> CDS 

<222> (144) . . (806) 



<400> 1 

aacattcctg caaatggttt caatatatgc agatgtctcg atataggaat gaaattacgt 



60 



ctttggaaca acttaaataa gtcaaatata cttggagctt taaaaattaa aaggagagag 



120 



attcgagcac cttttctgct gcc atg aca acc atg caa gga atg gaa cag gcc 

Met Thr Thr Met Gin Gly Met Glu Gin Ala 
15 10 



173 
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atg cca ggg get ggc cct ggt gtg ccc cag ctg gga aac atg get gtc 221 
Met Pro Gly Ala Gly Pro Gly Val Pro Gin Leu Gly Asn Met Ala Val 
15 20 25 

ata cat tea cat ctg tgg aaa gga ttg caa gag aag ttc ttg aag gga 269 
lie His Ser His Leu Trp Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly 
30 35 40 

gaa ccc aaa gtc ctt ggg gtt gtg cag att ctg act gee ctg atg age 317 
Glu Pro Lys Val Leu Gly Val Val Gin lie Leu Thr Ala Leu Met Ser 
45 50 55 

ctt age atg gga ata aca atg atg tgt atg gca tct aat act tat gga 365 
Leu Ser Met Gly lie Thr Met Met Cys Met Ala Ser Asn Thr Tyr Gly 
60 65 70 

agt aac cct att tec gtg tat ate ggg tac aca att tgg ggg tea gta 413 
Ser Asn Pro lie Ser Val Tyr lie Gly Tyr Thr lie Trp Gly Ser Val 
75 80 85 90 

atg ttt att att tea gga tec ttg tea att gca gca gga att aga act 461 
Met Phe lie He Ser Gly Ser Leu Ser He Ala Ala Gly He Arg Thr 
95 100 105 

aca aaa ggc ctg gtc cga ggt agt eta gga atg aat ate acc age tct 509 
Thr Lys Gly Leu Val Arg Gly Ser Leu Gly Met Asn He Thr Ser Ser 
110 115 120 

gta ctg get gca tea ggg ate tta ate aac aca ttt age ttg gcg ttt 557 
Val Leu Ala Ala Ser Gly He Leu He Asn Thr Phe Ser Leu Ala Phe 
125 130 135 

tat tea ttc cat cac cct tac tgt aac tac tat ggc aac tea aat aat 605 
Tyr Ser Phe His His Pro Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn 
140 145 150 

tgt cat ggg act atg tec ate tta atg ggt ctg gat ggc atg gtg etc 653 
Cys His Gly Thr Met Ser He Leu Met Gly Leu Asp Gly Met Val Leu 
155 160 • 165 170 

etc tta agt gtg ctg gaa ttc tgc att get gtg tec etc tct gee ttt 701 
Leu Leu Ser Val Leu Glu Phe Cys He Ala Val Ser Leu Ser Ala Phe 
175 180 185 

gga tgt aaa gtg etc tgt tgt acc cct ggt ggg gtt gtg tta att ctg 74 9 

Gly Cys Lys Val Leu Cys Cys Thr Pro Gly Gly Val Val Leu He Leu 
190 195 200 

cca tea cat tct cac atg gca gaa aca gca tct ccc aca cca ctt aat 797 
Pro Ser His Ser His Met Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn 
205 210 215 

gag gtt tga ggccaccaaa agatcaacag acaaatgetc cagaaatcta 846 
Glu Val 
220 



tgctgactgt gacacaagag cctcacatga gaaattacca gtatccaact tcgatactga 906 
tagacttgtt gatattatta ttatatgtaa tccaattatg aactgtgtgt gtatagagag 966 
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ataataaatt 


caaaattatg 


ttctcatttt 


tttccctgga 


actcaataac 


tcatttcact 


ggctctttat 


cgagagtact 


agaagttaaa 


ttaataaata 


atgcatttaa 


tgaggcaaca 


gcacttgaaa 


gtttttcatt 


catcataaga 


actttatata 


aaggcattac 


attggcaaat 


aaggtttgga 


agcagaagag 


caaaaaaaag 


atattgttaa 


aatgaggcct 


ccatgcaaaa 


cacatacttc 


cctcccat tt 


atttaacttt 


ttttttctcc 


tacctatggg 


gaccaaagtg 


ctttttcctt 


caggaagtgg 


agatgcatgg 


ccatctcccc 


ctcccttttt 


ccttctcctg 


cttttctttc 


cccatagaaa 


gtaccttgaa 


gtagcacagt 


ccgtccttgc 


atgtgcacga 


gctatcattt 


gagtaaaagt 


atacatggag 


taaaaatcat 


attaagcatc 


agattcaact 


tatattttct 


atttcatctt 


cttcctttcc 


cttctcccac 


cttctactgg 


gcataattat 


atcttaatca 


tatatggaaa 


tgtgcaacat 


atggtatttg 


ttaaatacgt 


ttgtttttat 


tgcagagcaa 


aaataaatca 


aattagaagc 


aataaaaaaa 


aaaaaaaaaa 


aaa 



<210> 2 

<211> 220 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Thr Thr Met Gin Gly Met Glu Gin Ala Met Pro Gly Ala Gly Pro 
15 10 15 



Gly Val Pro Gin Leu Gly Asn Met Ala Val lie His Ser His Leu Trp 
20 25 30 



Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly 
35 40 45 



Val Val Gin lie Leu Thr Ala Leu Met Ser Leu Ser Met Gly He Thr 
50 55 60 



Met Met Cys Met Ala Ser Asn Thr Tyr Gly Ser Asn Pro He Ser Val 
65 70 75 80 



Tyr He Gly Tyr Thr He Trp Gly Ser Val Met Phe He He Ser Gly 
85 - 



Ser Leu Ser He Ala Ala Gly He Arg Thr Thr Lys Gly Leu Val Arg 
100 105 110 



1026 
1086 
1146 
1206 
1266 
1326 
1386 
1446 
1506 
1566 
1619 
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Gly Ser Leu Gly Met Asn lie Thr Ser Ser Val Leu Ala Ala Ser Gly 
115 120 125 



lie Leu lie Asn Thr Phe Ser Leu Ala Phe Tyr Ser Phe His His Pro 
130 135 140 



Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn Cys His Gly Thr Met Ser 
145 150 155 160 



lie Leu Met Gly Leu Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 
165 170 175 



Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
180 185 190 



Cys Thr Pro Gly Gly Val Val Leu lie Leu Pro Ser His Ser His Met 
195 200 205 



Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn Glu Val 
210 215 220 



<210> 3 

<211> 663 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (663) 



<400> 3 

atg aca acc atg caa gga atg gaa cag acc act cca ggg cct ggc cct 4 8 

Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
1 5 10 15 

gat gtg ccc cag ctg gga aac ata gat gtc ata cat tea tat ctg tgt 96 
Asp Val Pro Gin Leu Gly Asn lie Asp Val lie His Ser Tyr Leu Cys 
20 25 30 



aaa gga ttg caa gag aag ttc ttc aag agg aaa ccc aaa gtc ctt ggg 144 
Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 40 45 
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gtt gtg egg att ctg att gec ttg atg age ctt age atg gga ata ata 192 
Val Val Arg lie Leu lie Ala Leu Met Ser Leu Ser Met Gly lie lie 
50 55 60 

5 atg atg tgt gtt gca ttt agt tct tat gaa gaa cat ccc att ttt gtg 240 

Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro lie Phe Val 
65 70 75 80 

tat gtt gcg tac aca att tgg ggg tea gtg atg ttt att att tea gta 288 
10 Tyr Val Ala Tyr Thr He Trp Gly Ser Val Met Phe He He Ser Val 

85 90 95 

tec tta tea gtt gca gca gga att aga aca aca aaa ggt ctg gtt gga 33 6 

Ser Leu Ser Val Ala Ala Gly He Arg Thr Thr Lys Gly Leu Val Gly 
15 ' 100 105 110 

ggt agt eta gga aag aat ate acc agt tea gtc ttg get ata tea ggg 3 84 

Gly Ser Leu Gly Lys Asn He Thr Ser Ser Val Leu Ala He Ser Gly 
115 120 125 

20 

ate tta ate aat gca ata age ttg acg ttt tat tea ttc cgt tac cat 432 
He Leu He Asn Ala He Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 

25 tac tgt aac cac gat cag ttg tea agt aat tgt tac atg act atg tec 480 

Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 

att tta atg ggt acg gat ggc atg gtg etc etc tta agt gtg ctg gaa 528 
30 He Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 

165 170 175 

ttc tgc att get gtg tec etc tct gee ttt gga tgt aaa gtg ctt tgt 576 
Phe Cys He Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
35 180 185 190 

tgt age ccc agt gag ttt gtg eta att aca cca tea aat tct cac atg 624 
Cys Ser Pro Ser Glu Phe Val Leu He Thr Pro Ser Asn Ser His Met 
195 200 205 



40 



45 



55 



60 



gca gaa ata gca tct ccc aca cca ctt aag aca gtt tga 663 
Ala Glu He Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 4 

<211> 220 

50 <212> PRT 

<213> Homo sapiens 



<400> 4 

Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
15 10 15 



-5- 
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Asp Val Pro Gin Leu Gly Asn lie Asp Val lie His Ser Tyr Leu Cys 
20 ' 25 30 



Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 40 45 



Val Val Arg lie Leu lie Ala Leu Met Ser Leu Ser Met Gly lie lie 
50 55 60 



Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro lie Phe Val 
65 70 75 80 



Tyr Val Ala Tyr Thr lie Trp Gly Ser Val Met Phe lie lie Ser Val 
85 90 95 



Ser Leu Ser Val Ala Ala Gly lie Arg Thr Thr Lys Gly Leu Val Gly 
100 105 110 



Gly Ser Leu Gly Lys Asn lie Thr Ser Ser Val Leu Ala lie Ser Gly 
115 120 125 



lie Leu lie Asn Ala lie Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 



Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 



lie Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 
165 170 175 



Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 
180 185 190 



Cys Ser Pro Ser Glu Phe Val Leu lie Thr Pro Ser Asn Ser His Met 
195 200 205 



Ala Glu lie Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 5 
<211> 694 
<212> DNA 



<213> Homo sapiens 
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<220> 

<221> cDNA 
<222> (1) . . (694) 



<220> 

<221> CDS 

<222> (55) . . (657) 



<400> 5 

ctagactgaa gtaccaacta aatcatctcc tttcaaatta tcaccgacac catc atg 57 

Met 
1 

gat tea age acc gca cac agt ccg gtg ttt ctg gta ttt cct cca gaa 105 
Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro Glu 
5 10 15 

ate act get tea gaa tat gag tec aca gaa ctt tea gcc acg acc ttt 153 
lie Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr Phe 
20 25 30 

tea act caa age ccc ttg caa aaa tta ttt get aga aaa atg aaa ate 201 
Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys lie 
35 40 45 

tta ggg act ate cag ate ctg ttt gga att atg acc ttt tct ttt gga 249 
Leu Gly Thr lie Gin lie Leu Phe Gly lie Met Thr Phe Ser Phe Gly 
50 55 60 65 

gtt ate ttc ctt ttc act ttg tta aaa cca tat cca agg ttt ccc ttt 297 
Val lie Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro Phe 
70 75 80 

ata ttt ctt tea gga tat cca ttc tgg ggc tct gtt ttg ttc att aat 345 
lie Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe lie Asn 
85 90 95 

tct gga gcc ttc eta att gca gtg aaa aga aaa acc aca gaa act ctg 393 
Ser Gly Ala Phe Leu lie Ala Val Lys Arg Lys Thr Thr Glu Thr Leu 
100 105 110 

ata ata ttg age cga ata atg aat ttt ctt agt gcc ctg gga gca ata 441 
lie lie Leu Ser Arg lie Met Asn Phe Leu Ser Ala Leu Gly Ala lie 
115 120 125 

get gga ate att etc etc aca ttt ggt ttc ate eta gat caa aac tac 489 
Ala Gly lie lie Leu Leu Thr Phe Gly Phe lie Leu Asp Gin Asn Tyr 
130 135 140 145 

att tgt ggt tat tct cac caa aat agt cag tgt aag get gtt act gtc 537 
lie Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr Val 
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150 155 160 

ctg ttc ttg gga att ttg att aca ttg atg act ttc age att att gaa 585 
Leu Phe Leu Gly lie Leu lie Thr Leu Met Thr Phe Ser lie lie Glu 
165 170 175 

tta ttc att tct ctg cct ttc tea att ttg ggg tgc cac tea gag gat 633 
Leu Phe lie Ser Leu Pro Phe Ser lie Leu Gly Cys His Ser Glu Asp 
180 185 190 

tgt gat tgt gaa caa tgt tgt tga ctagcactgt gagaataaag atgtgttaaa 687 
Cys Asp Cys Glu Gin Cys Cys 
195 200 

ataaaaa 694 



<210> 6 

<211> 200 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro 
15 10 15 



Glu lie Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr 
20 25 30 



Phe Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys 
35 40 45 



lie Leu Gly Thr lie Gin lie Leu Phe Gly lie Met Thr Phe Ser Phe 
50 55 60 



Gly Val lie Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro 
65 70 75 80 



Phe lie Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe lie 
85 90 95 



Asn Ser Gly Ala Phe Leu lie Ala Val Lys Arg Lys Thr Thr Glu Thr 
100 105 110 



Leu lie lie Leu Ser Arg lie Met Asn Phe Leu Ser Ala Leu Gly Ala 
115 120 125 
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lie Ala Gly lie lie Leu Leu Thr Phe Gly Phe lie Leu Asp Gin Asn 
130 135 140 

Tyr lie Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr 
145 150 155 160 



Val Leu Phe Leu Gly lie Leu lie Thr Leu Met Thr Phe Ser lie lie 

165 170 175 

Glu Leu Phe lie Ser Leu Pro Phe Ser lie Leu Gly Cys His Ser Glu 

180 185 190 



Asp Cys Asp Cys Glu Gin Cys Cys 
195 200 

<210> 7 

<211> 1152 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1152) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP , or dTTP) 
<220> 

<221> CDS 

<222> (239) . . (985) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP , or dTTP) 
<400> 7 

ttttactgac cttgctagaa gtttacagnc acggaagtgc aggaacattt cacaaatcta 60 

caatctgtga gtatcacatc ctgtatagct gtaaacactg gaataaggaa gggctgatga 120 

ctttcagaag atgaaggtaa gtagaaaccg ttgatgggac tgagaaacca gagttaaaac 180 

ctctttggag cttctgagga ctcagctgga accaacgggc acagttggca acaccatc 23 8 

atg aca tea caa cct gtt ccc aat gag acc ate ata gtg etc cca tea 286 
Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
15 10 15 

aat gtc ate aac ttc tec caa gca gag aaa ccc gaa ccc acc aac cag 334 
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Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 

ggg cag gat age ctg aag aaa cat eta cac gca gaa att aaa gtt att 382 
Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu lie Lys Val He 
35 40 45 

ggg act ate cag ate ttg tgt ggc atg atg gta ttg age ttg ggg ate 430 
Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 

att ttg gca tct get tec ttc tct cca aat ttt ace caa gtg act tct 478 
He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 

aca ctg ttg aac tct get tac cca ttc ata gga ccc ttt ttt ttt ate 526 
Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 
85 90 95 

ate tct ggc tct eta tea ate gee aca gag aaa agg tta ace aag ctt 574 
He Ser Gly Ser Leu Ser He Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 

ttg gtg cat age age ctg gtt gga age att ctg agt get ctg tct gee 622 
Leu Val His Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 

ctg gtg ggt ttc att ate ctg tct gtc aaa cag gee ace tta aat cct 670 
Leu Val Gly Phe He He Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 

gee tea ctg cag tgt gag ttg gac aaa aat aat ata cca aca aga agt 718 
Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn He Pro Thr Arg Ser 
145 150 155 160 

tat gtt tct tac ttt tat cat gat tea ctt tat ace acg gac tgc tat 766 
Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 

aca gee aaa gee agt ctg get gga act etc tct ctg atg ctg att tgc 814 
Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu He Cys 
180 185 190 

act ctg ctg gaa ttc tgc eta get gtg etc act get gtg ctg egg tgg 862 
Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 

aaa cag get tac tct gac ttc cct ggg agt gta ctt ttc ctg cct cac 910 
Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 

agt tac att ggt aat tct ggc atg tec tea aaa atg act cat gac tgt 958 
Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 

gga tat gaa gaa eta ttg act tct taa gaaaaaaggg agaaatatta 1005 
Gly Tyr Glu Glu Leu Leu Thr Ser 
245 

atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa aagcaaagct 1065 
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tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca ttatttcact 1125 
gtcaaaaaaa aaaaaaaaaa aaaaaaa 1152 

<210> 8 

<211> 248 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
15 10 15 



Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu lie Lys Val lie 
35 40 45 



Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 



He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 
85 90 95 



He Ser Gly Ser Leu Ser He Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe He He Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn He Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 
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Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu lie Cys 
180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 



Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 



Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 



Gly Tyr Glu Glu Leu Leu Thr Ser 
245 



<210> 9 

<211> 930 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1) . . (930) 

<220> 

<221> CDS 

<222> (17) . . (763) 



<400> 9 

agttggcaac accatc atg aca tea caa cct gtt ccc aat gag acc ate ata 52 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie 
15 10 

gtg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 100 
Val Leu Pro Ser Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 

ccc acc aac cag ggg cag gat age ctg aag aaa cat eta cac gca gaa 148 
Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu 
30 35 40 

ate aaa gtt att ggg act ate cag ate ttg tgt ggc atg atg gta ttg 196 
He Lys Val He Gly Thr He Gin He Leu Cys Gly Met Met Val Leu 



-12- 



WO 02/062946 



PCT/US01/48437 



45 50 55 60 

age ttg ggg ate att ttg gca tct get tec ttc tct cca aat ttt ace 244 
Ser Leu Gly lie lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr 
65 70 75 

caa gtg act tct aca ctg ttg aac tct get tac cca ttc ata gga ccc 292 
Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro 
80 85 90 

ttt ttt ttt ate ate tct ggc tct eta tea ate gee aca gag aaa agg 340 
Phe Phe Phe lie lie Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg 
95 100 105 

tta acc aag ctt ttg gtg cat age age ctg gtt gga age att ctg agt 388 
Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser 
110 115 120 

get ctg tct gee ctg gtg ggt ttc att ate ctg tct gtc aaa cag gee 436 
Ala Leu Ser Ala Leu Val Gly Phe lie lie Leu Ser Val Lys Gin Ala 
125 130 135 140 

acc tta aat cct gec tea ctg cag tgt gag ttg gac aaa aat aat ata 484 
Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie 
145 150 155 

cca aca aga agt tat gtt tct tac ttt tat cat gat tea ctt tat acc 532 
Pro Thr Arg Ser Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr 
160 165 170 



acg gac tgc tat aca gec aaa gec agt ctg get gga tct etc tct ctg 580 
Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu 
175 180 185 

atg ctg att tgc act ctg ctg gaa ttc tgc eta get gtg etc act get 628 
Met Leu lie Cys Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala 
190 195 200 

gtg ctg egg tgg aaa cag get tac tct gac ttc cct ggg agt gta ctt 676 
Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu 
205 210 215 220 

ttc ctg cct cac agt tac att ggt aat tct ggc atg tec tea aaa atg 724 
Phe Leu Pro His Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met 
225 230 235 

act cat gac tgt gga tat gaa gaa eta ttg act tct taa gaaaaaaggg 773 
Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 
240 245 

agaaatatta atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa 833 

aagcaaagct tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca 893 

ttatttcact gtcaaaaaaa aaaaaaaaaa aaaaaaa 93 0 

<210> 10 
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<211> 248 
<212> PRT 
<213> Homo sapiens 

<400> 10 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
15 10 15 



Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val He 
35 40 45 



Gly Thr He Gin He Leu Cys Gly Met Met Val Leu Ser Leu Gly He 
50 55 60 



He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe He Gly Pro Phe Phe Phe He 
85 90 95 



He Ser Gly Ser Leu Ser He Ala Thr Glu Lys Arg Leu Thr Lys Leu 
100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe He He Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn He Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 



Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu Met Leu He Cys 
180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 



-14- 



WO 02/062946 



PCT/US01/48437 



Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 

Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 

Gly Tyr Glu Glu Leu Leu Thr . Ser 
' 245 

<210> 11 

<211> 661 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (15) . . (458) 



<400> 11 

ttggcaacac catt atg aca tea caa cct att tec aat gag ace ate ata 50 

Met Thr Ser Gin Pro lie Ser Asn Glu Thr lie lie 
15 10 

atg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 98 

Met Leu Pro Ser Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 

ccc acc aac cag ggg cag gat age ctg aag aaa cgt eta cag gca aaa 146 

Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys 

30 35 40 



gtc aaa gtt att ggg gtg cat age age ctg get gga age att ctg agt 194 
Val Lys Val lie Gly Val His Ser Ser Leu Ala Gly Ser He Leu Ser 
45 50 55 60 

get ctg tct gee ctg gtg ggt ttc att etc ctg tct gtc aac ccg get 242 
Ala Leu Ser Ala Leu Val Gly Phe He Leu Leu Ser Val Asn Pro Ala 
65 70 75 

gca tta aat cct gee tea ttg cag tgt aag ttg gac gaa aag gat ata 290 
Ala Leu Asn Pro Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp He 
80 85 90 

cca acc aga ctt ctt ctt tct tat gat tat cat tea cct tac acc atg 338 
Pro Thr Arg Leu Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met 
95 100 105 
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gac tgc cat aga gcc aaa gcc agt ctg get gga act ctg tct ctg atg 3 86 

Asp Cys His Arg Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met 
110 115 120 

ctg gtt tct act gtg ttg gag ttc tgc eta get gtg etc act get gtg 434 
Leu Val Ser Thr Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val 
125 130 135 140 

ctg cag tgg aaa cag act gtc tga cttccctggt tactcatgtc ctcaaagacg 488 
Leu Gin Trp Lys Gin Thr Val 
145 

actcatgatg ctggatatga agaactattg acttcttggg aaaaaacgga gaaatattaa 548 

ttggaaagtt gatttttatg ataatatgga aaacctaacc attataaaaa agcaaacttg 608 

agtttcctaa atgtaagcat ttaaagtaaa tgcatatttg ttttaaaaaa tta 661 

<210> 12 

<211> 147 

<212> PRT 

<213> Homo sapiens 



<400> 12 

Met Thr Ser Gin Pro lie Ser Asn Glu Thr lie lie Met Leu Pro Ser 
15 10 15 



Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 



Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys Val Lys Val lie 
35 40 45 



Gly Val His Ser Ser Leu Ala Gly Ser lie Leu Ser Ala Leu Ser Ala 
50 55 60 



Leu Val Gly Phe lie Leu Leu Ser Val Asn Pro Ala Ala Leu Asn Pro 
65 70 75 80 



Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp lie Pro Thr Arg Leu 
85 90 95 



Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met Asp Cys His Arg 
100 105 " 110 



Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu Val Ser Thr 
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115 120 125 



Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Gin Trp Lys 
130 135 140 



Gin Thr Val 
145 



<210> 13 

<211> 1228 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDMA 

<222> (1) . . (1228) 

<220> 

<221> CDS 

<222> (118) . . (840) 



<400> 13 

agattgttct ggcaaggaac agccagtggg aggttccagc tgagcgctcc ccagaggtga 60 

gctgatcccc agccacagca cacaggacca ggctgcgaga acagcatcat cagcatc 117 

atg eta tta caa tec caa acc atg ggg gtt tct cac age ttt aca cca 165 
Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 

aag ggc ate act ate cct caa aga gag aaa cct gga cac atg tac caa 213 
Lys Gly lie Thr lie Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 
20 25 30 

aac gaa gat tac ctg cag aac ggg ctg cca aca gaa acc acc gtt ctt 261 
Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35 40 45 

ggg act gtc cag ate ctg tgt tgc ctg ttg att tea agt ctg ggg gec 309 
Gly Thr Val Gin lie Leu Cys Cys Leu Leu He Ser Ser Leu Gly Ala 
50 55 60 

ate ttg gtt ttt get ccc tac ccc tec cac ttc aat cca gca att tec 357 
He Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala He Ser 
65 70 75 80 
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acc act ttg atg tct ggg tac cca ttt tta gga get ctg tgt ttt ggc 405 
Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 
85 90 95 

att act gga tec etc tea att ate tct gga aaa caa tea act aag ccc 453 
lie Thr Gly Ser Leu Ser lie lie Ser Gly Lys Gin Ser Thr Lys Pro 
100 105 110 

ttt gac ctg age age ttg acc tea aat gca gtg agt tct gtt act gca 501 
Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 

gga gca ggc etc ttc etc ctt get gac age atg gta gee ctg agg act 549 
Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 

gec tct caa cat tgt ggc tea gaa atg gat tat eta tec tea ttg cct 597 
Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 

tat teg gag tac tat tat cca ata tat gaa ate aaa gat tgt etc ctg 645 
Tyr Ser Glu Tyr Tyr Tyr Pro lie Tyr Glu lie Lys Asp Cys Leu Leu 
165 170 175 

acc agt gtc agt tta aca ggt gtc eta gtg gtg atg etc ate ttc act 693 
Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu lie Phe Thr 
180 185 190 

gtg ctg gag etc tta tta get gca tac agt tct gtc ttt tgg tgg aaa 741 
Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 

cag etc tac tec aac aac cct ggg agt tea ttt tec teg acc cag tea 789 
Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 

caa gat cat ate caa cag gtc aaa aag agt tct tea egg tct tgg ata 837 
Gin Asp His lie Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp lie 
225 230 235 240 

taa gtaactcttg gectcagagg aaggaaaagc aactcaacac tcatggtcaa 890 

gtgtgattag actttcctga aatctctgcc attttagata ctgtgaaaca aactaaaaaa 950 

aaaaaagctt ttgttttgta tttgtttact atgagtcgtt atttaatttc tcttgaaaat 1010 

aatttcctca aagcccaagt caataaatgt tatcagecag tcttccaaaa tggtcataaa 1070 

ctttataaac tgctttgggt aaactgagca gaaggtgata cacagaaggg aaaatgtgca 113 0 

etcatgetag tgtgaatttg gtaagtcgcg tgactctgea ggctgtttct gtattatttt 1190 

cacactcata ttgcttaaat attacatatt agggattg 1228 



<210> 14 
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<211> 240 
<212> PRT 
<213> Homo sapiens 

<400> 14 

Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 



Lys Gly lie Thr lie Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 
20 25 30 



Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35 40 45 



Gly Thr Val Gin lie Leu Cys Cys Leu Leu lie Ser Ser Leu Gly Ala 
50 55 60 



lie Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala lie Ser 
65 70 75 80 



Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 
85 * 90 95 



lie Thr Gly Ser Leu Ser lie lie Ser Gly Lys Gin Ser Thr Lys Pro 
100 105 110 



Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 



Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 



Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 



Tyr Ser Glu Tyr Tyr Tyr Pro lie Tyr Glu lie Lys Asp Cys Leu Leu 
165 170 175 



Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu lie Phe Thr 
180 185 ' 190 



Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 
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Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 

Gin Asp His lie Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp lie 
225 230 235 240 

<210> 15 

<211> 1369 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1) . . (1369) 

<220> 

<221> CDS 

<222> (204) . . (956) 



<400> 15 

aaacaggaaa taaatacgaa tgaaactgag ctctaagcag catgtaacct ggcctgcatc 60 

caggaaatag aggacttcgg atccttctaa ccctaccacc caactggccc cagtacattc 12 0 

attctctcag gaaaaaaaac aaggtcccca cagcaaagaa aaggaatagg atcaagagat 180 

acgtggctgc tggcagagca age atg aat teg atg act tea gca gtt ccg gtg 23 3 

Met Asn Ser Met Thr Ser Ala Val Pro Val 
15 10 

gec aat tct gtg ttg gtg gtg gca ccc cac aat ggt tat cct gtg acc 281 
Ala Asn Ser Val Leu Val Val Ala Pro His Asn Gly Tyr Pro Val Thr 
15 20 25 

cca gga att atg tct cac gtg ccc ctg tat cca aac age cag ccg caa 329 
Pro Gly lie Met Ser His Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin 
30 35 40 

gtc cac eta gtt cct ggg aac cca cct agt ttg gtg teg aat gtg aat 377 
Val His Leu Val Pro Gly Asn Pro Pro Ser Leu Val Ser Asn Val Asn 
45 50 55 

ggg cag cct gtg cag aaa get ctg aaa gaa ggc aaa acc ttg ggg gee 42 5 

Gly Gin Pro Val Gin Lys Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala 
60 65 70 
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ate cag ate 
lie Gin lie 
75 

gcg acg gtt 
Ala Thr Val 



ttt ccc ttc 
Phe Pro Phe 



gtg gca gca 
Val Ala Ala 
125 

ttg ggc ttg 
Leu Gly Leu 
140 



ate att ggc 
He He Gly 
80 

etc gta ggg 
Leu Val Gly 
95 

tgg gga ggc 
Trp Gly Gly 
110 

gaa aat cag 
Glu Asn Gin 



aac ate gtc 
Asn He Val 



ctg get cac 
Leu Ala His 



gaa tac ctg 
Glu Tyr Leu 



ttg tgg ttt 
Leu Trp Phe 
115 

cca tat tct 
Pro Tyr Ser 
130 

agt gca ate 
Ser Ala He 
145 



ate ggc etc 
He Gly Leu 
85 

tct att tea 
Ser He Ser 
100 

ate att tea 
He He Ser 



tat tgc ctg 
Tyr Cys Leu 



tgc tct gca 
Cys Ser Ala 
150 



ggc tec ate 
Gly Ser He 



ttc tac gga 
Phe Tyr Gly 
105 

gga tct etc 
Gly Ser Leu 
120 

ctg tct ggc 
Leu Ser Gly 
135 

gtt gga gtc 
Val Gly Val 



atg 473 

Met 

90 

ggc 521 
Gly 



tec 569 
Ser 



agt 617 
Ser 



ata 665 
He 



etc ttc ate aca gat eta agt att ccc cac cca tat gec tac ccc gac 713 
Leu Phe He Thr Asp Leu Ser He Pro His Pro Tyr Ala Tyr Pro Asp 
155 160 165 170 

tat tat cct tac gec tgg ggt gtg aac cct gga atg gcg att tct ggc 761 
Tyr Tyr Pro Tyr Ala Trp Gly Val Asn Pro Gly Met Ala He Ser Gly 
175 180 185 

gtg ctg ctg gtc ttc tgc etc ctg gag ttt ggc ate gca tgc gca tct 809 
Val Leu Leu Val Phe Cys Leu Leu Glu Phe Gly He Ala Cys Ala Ser 
190 195 200 

tec cac ttt ggc tgc cag ttg gtc tgc tgt caa tea age aat gtg agt 857 
Ser His Phe Gly Cys Gin Leu Val Cys Cys Gin Ser Ser Asn Val Ser 
205 210 * 215 

gtc ate tat cca aac ate tat gca gca aac cca gtg ate acc cca gaa 905 
Val He Tyr Pro Asn He Tyr Ala Ala Asn Pro Val He Thr Pro Glu 
220 225 230 

ccg gtg acc tea cca cca agt tat tec agt gag ate caa gca aat aag 953 
Pro Val Thr Ser Pro Pro Ser Tyr Ser Ser Glu He Gin Ala Asn Lys 
235 240 245 250 

taa ggctacagat tctggaagca tctttcactg ggaccaaaag aagtcctcct 1006 

ccctttctgg gcttccataa cccaggtcgt tcctgttctg acagctgagg aaaegtctet 1066 

cccactgttt gtactctcac cttcattctt caattcagtc taggaaacca tgctgtttct, 1126 

ctatcaagaa gaagacagag attttaaaca gatgttaacc aagagggact ccctagggca 1186 

catgeatcag cacatatgtg ggcatccagc ctctggggcc ttggcacaca cacattegtg 1246 

tgctctgctg catgtgagct tgtgggttag aggaacaaat atctagacat tcaatcttca 1306 

ctctttcaat tgtgcattca tttaataaat agatactgag cattcaaaaa aaaaaaaaaa 1366 

aaa 1369 
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<210> 16 

<211> 250 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Asn Ser Met Thr Ser Ala Val Pro Val Ala Asn Ser Val Leu Val 
15 10 15 



Val Ala Pro His Asn Gly Tyr Pro Val Thr Pro Gly lie Met Ser His 
20 25 30 



Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin Val His Leu Val Pro Gly 
35 40 45 



Asn Pro Pro Ser Leu Val Ser Asn Val Asn Gly Gin Pro Val Gin Lys 
50 55 60 



Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala lie Gin lie lie lie Gly 
65 70 75 80 



Leu Ala His lie Gly Leu Gly Ser lie Met Ala Thr Val Leu Val Gly 
85 90 95 



Glu Tyr Leu Ser lie Ser Phe Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
100 105 110 



Leu Trp Phe lie lie Ser Gly Ser Leu Ser Val Ala Ala Glu Asn Gin 
115 120 125 



Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser Leu Gly Leu Asn lie Val 
130 135 140 



Ser Ala He Cys Ser Ala Val Gly Val He Leu Phe He Thr Asp Leu 
145 150 155 160 



Ser He Pro His Pro Tyr Ala Tyr Pro Asp Tyr Tyr Pro Tyr Ala Trp 
165 170 175 



Gly Val Asn Pro Gly Met Ala He Ser Gly Val Leu Leu Val Phe Cys 
180 185 190 
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Leu Leu Glu Phe Gly lie Ala Cys Ala Ser Ser His Phe Gly Cys Gin 
195 200 205 



Leu Val Cys Cys Gin Ser Ser Asn Val Ser Val lie Tyr Pro Asn lie 
210 215 220 



Tyr Ala Ala Asn Pro Val lie Thr Pro Glu Pro Val Thr Ser Pro Pro 
225 230 235 240 



Ser Tyr Ser Ser Glu lie Gin Ala Asn Lys 
245 250 



<210> 17 

<211> 726 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (726) 



<400> 17 

atg aaa gca gaa gcc aca gtt att ccc age cgt tgt get agg ggg etc 48 
Met Lys Ala Glu Ala Thr Val lie Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 

cca tea tgg caa gtc etc age cca gtc cag ccc tgg cag aca agt gca 96 
Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 
20 25 30 

ccc cag aac acg acc cag ccc aag etc ctg get cca cac cag cac gag 144 
Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 
35 40 45 

aag tec cag aag aag age age ctt ctt aag gag ctg ggg gcc ttc cac 192 
Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 

ate acc ate get ctg ctg cac ctg gtc ttt ggg ggc tac ctg gcc tct 240 
lie Thr lie Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 

ata gtc aag aac ctt cac ctg gtg gtg ctg aag tct tgg tat cca ttc 288 
lie Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 
85 90 95 
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tgg ggg get gec tct ttt etc att tea ggg ate ttg gcg ata aca atg 33 6 

Trp Gly Ala Ala Ser Phe Leu lie Ser Gly lie Leu Ala lie Thr Met 
100 105 110 

aag ace ttt tct aaa act tac ctg aag atg ttg tgc ctg atg aca aac 384 
Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 

etc ate age etc ttt tgc gtg ctg tct ggc etc ttc gtc ate tec aag 432 
Leu lie Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val lie Ser Lys 
130 135 140 

gat etc ttt ctg gag age cca ttt gag tec ccg ate tgg aga atg tac 480 
Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro lie Trp Arg Met Tyr 
145 150 155 160 

ccc aac tec acg gtc cac ate cag agg ctg gag ctg gec ttg etc tgc 52 8 

Pro Asn Ser Thr Val His lie Gin Arg Leu Glu Leu Ala Leu Leu Cys 
165 170 175 

ttc act gtc eta gag etc ttc ctg cca gtg ccc aca get gtc aca gee 576 
Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 
180 185 190 

tgg aga ggg gac tgc cca tct gca aag aat gat gat gca tgc ctt gtt 624 
Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 

ccg aat aca cca ttg cat etc aaa ggc ctg ccg gtg gag ccc ccg cca 672 
Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 

tec tac cag agt gtg att caa ggc gac gca caa cac aag caa cat cag 720 
Ser Tyr Gin Ser Val He Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 

agg tga 72 6 

Arg 



<210> 18 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Lys Ala Glu Ala Thr Val He Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 



Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 
20 25 30 



Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 
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35 40 45 



Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 



lie Thr lie Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 



lie Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 
85 90 95 



Trp Gly Ala Ala Ser Phe Leu lie Ser Gly lie Leu Ala lie Thr Met 
100 105 110 



Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 



Leu lie Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val lie Ser Lys 
130 135 140 



Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro lie Trp Arg Met Tyr 
145 150 155 160 



Pro Asn Ser Thr Val His lie Gin Arg Leu Glu Leu Ala Leu Leu Cys 
165 170 175 



Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 
180 185 190 



Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 



Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 



Ser Tyr Gin Ser Val lie Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 



Arg 

<210> 19 

<211> 518 

<212> DNA 
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<213> Mus musculus 
<220> 

<221> cDNA 

<222> (1) . . (518) 

<220> 

<221> CDS 

<222> (148) . . (516) 



<400> 19 

taagcgctga ggaggaagag actgctggtt ttgggggaca gactctggtg gtcattactg 60 

tctcctcttc tgtaatgagt tggacttgca ggggaaggac ttgtcaaatc ctgaattctt 12 0 

caaggtggta cctcatcagg cagacac atg aag cca gag gag act ggt ggt tct 174 

Met Lys Pro Glu Glu Thr Gly Gly Ser 
1 5 

gtt tat cag ccc ttg gat gag tea cgc cat gtt caa cga ggt gta ctg 222 
Val Tyr Gin Pro Leu Asp Glu Ser Arg His Val Gin Arg Gly Val Leu 
10 15 20 25 

caa gec etc ggg gec ate cag ate ctg aat gga ata ctg att eta get 270 
Gin Ala Leu Gly Ala lie Gin lie Leu Asn Gly lie Leu lie Leu Ala 
30 35 40 

etc gga att ttt ctg gtt tgt tta caa cac gtg tec cac cac ttc agg 318 
Leu Gly lie Phe Leu Val Cys Leu Gin His Val Ser His His Phe Arg 
45 50 55 

cat ttc ttc ttc ttc ace ttc tac aca ggc tac cca ctg tgg ggt get 366 
His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro Leu Trp Gly Ala 
60 65 70 



gtg ttt ttt ate age tea gga tec ttg act gtt gee gca ggg aga aac 414 

Val Phe Phe lie Ser Ser Gly Ser Leu Thr Val Ala Ala Gly Arg Asn 
75 80 85 

ccc aca cga atg ctg atg caa aac agt ttt ggg ata aac att gee agt 462 

Pro Thr Arg Met Leu Met Gin Asn Ser Phe Gly lie Asn lie Ala Ser 
90 95 100 105 



act aca att gca ttt gtt ggg act gtt ttc ctt tct gtg cat ttg gca 510 
Thr Thr He Ala Phe Val Gly Thr Val Phe Leu Ser Val His Leu Ala 
110 115 120 

ttc aat ac 518 
Phe Asn 
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<210> 20 

<211> 123 

<212> PRT 

<213> Mus musculus 

<400> 20 

Met Lys Pro Glu Glu Thr Gly Gly Ser Val Tyr Gin Pro Leu Asp Glu 
15 10 15 



Ser Arg His Val Gin Arg Gly Val Leu Gin Ala Leu Gly Ala lie Gin 
20 25 30 



lie Leu Asn Gly He Leu He Leu Ala Leu Gly He Phe Leu Val Cys 
35 40 45 



Leu Gin His Val Ser His His Phe Arg His Phe Phe Phe Phe Thr Phe 
50 55 60 



Tyr Thr Gly Tyr Pro Leu Trp Gly Ala Val Phe Phe He Ser Ser Gly 
65 70 75 80 



Ser Leu Thr Val Ala Ala Gly Arg Asn Pro Thr Arg Met Leu Met Gin 
85 90 95 



Asn Ser Phe Gly He Asn He Ala Ser Thr Thr He Ala Phe Val Gly 
100 105 110 



Thr Val Phe Leu Ser Val His Leu Ala Phe Asn 
115 120 



<210> 21 

<211> 1228 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 
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<222> (1) . . (1228) 
<220> 

<221> CDS 

<222> (98) . . (778) 



<400> 21 

ttcggcacga ggcacacaag caaagcttct cctggaataa atcagctgac atctgatcct 60 

ttccattgtc tgtactgttt ctgctgctct ggaaacc atg caa gga cag gaa cag 115 

Met Gin Gly Gin Glu Gin 
1 5 

acc acc atg gca gtg gtt cct gga gtt get gtg cct tea aag aat tct 163 
Thr Thr Met Ala Val Val Pro Gly Val Ala Val Pro Ser Lys Asn Ser 
10 15 20 

gtt atg aca tea caa atg tgg aat gag aag aaa gag aaa ttc ttg aag 211 
Val Met Thr Ser Gin Met Trp Asn Glu Lys Lys Glu Lys Phe Leu Lys 
25 30 35 

999 9^a ccc aaa gtc ctt ggg gtt tta caa gtg atg att get ate ata 259 
Gly Glu Pro Lys Val Leu Gly Val Leu Gin Val Met lie Ala lie He 
40 45 50 

aac etc age tta gga ata ata att ttg aca act tta ttt tct gaa eta 307 
Asn Leu Ser Leu Gly He He He Leu Thr Thr Leu Phe Ser Glu Leu 
55 60 65 70 

ccc act tea gtg atg tta atg gtc cca att tgg gga tea ata atg ttc 355 
Pro Thr Ser Val Met Leu Met Val Pro He Trp Gly Ser He Met Phe 
75 80 85 

att gtc tec gga tec ctg tec att gca gca gga gtg aca cct aca aaa 403 
He Val Ser- Gly Ser Leu Ser He Ala Ala Gly Val Thr Pro Thr Lys 
90 95 100 

tgc ctg ate gtt gee agt eta act ctg aac act ate acc tct gtg ttg 451 
Cys Leu He Val Ala Ser Leu Thr Leu Asn Thr He Thr Ser Val Leu 
105 110 115 

get gca act gca age ata atg ggt gta gtc agt gtg get gtg ggt tea 499 
Ala Ala Thr Ala Ser He Met Gly Val Val Ser Val Ala Val Gly Ser 
120 125 130 

cag ttt ccg ttt egg tat aat tat aca ate acc aag ggt ttg gat gtt 547 
Gin Phe Pro Phe Arg Tyr Asn Tyr Thr He Thr Lys Gly Leu Asp Val 
135 140 145 150 

ttg atg tta att ttc aat atg eta gaa ttc tgc ctt get gtg tec gtc 595 
Leu Met Leu He Phe Asn Met Leu Glu Phe Cys Leu Ala Val Ser Val 
155 160 165 

tct gca ttt gga tgt gaa get tec tgt tgt aac tec cgt gag gtt ctt 643 
Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys Asn Ser Arg Glu Val Leu 
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170 175 180 

gta gtg eta cca tea aat cct gtt gag aca gtg atg gca ccc cca atg 691 
Val Val Leu Pro Ser Asn Pro Val Glu Thr Val Met Ala Pro Pro Met 
185 190 195 

aca ctt caa cca ttg eta cca tea gaa cac caa ggg ace aat gtt cca 739 
Thr Leu Gin Pro Leu Leu Pro Ser Glu His Gin Gly Thr Asn Val Pro 
200 205 210 

gga aat gtg tac aag aac cac cca gga gaa ata gtc taa ttttgatgtg 78 8 
Gly Asn Val Tyr Lys Asn His Pro Gly Glu lie Val 
215 220 225 

tgtgtgtgtg tatttcccta ggatattaac acttcattgc actggctttt gaggtgaata 848 

ttagatttac tgtaagtatg taagtcaagc acttattagg tcaacaacac ttcaacatat 908 

tatattcatt gtatgtacaa ggggcaatga atttgeaaag atgttttgaa agcaaacaga 968 

aaaaaaaaaa aaacaaccaa acaaaagacc tcttagtgaa atgaggtctc tttgeaaaga 102 8 

ctaaaaaact ggagttcaca tttttggggg tggggggggc ttttgctaaa taagtagatt 1088 

tagatgettt tgatcaagta caaactcata aagtatgtaa gaaattacta atgataggaa 1148 

ccaattttaa tctcatatgt aacatagtgt ataatttaat agatctggta aaaatttata 1208 

ataaagagaa atgcctgaaa 122 8 

<210> 22 

<211> 226 

<212> PRT 

<213> Mus musculus 



<400> 22 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Val Ala 
15 10 15 



Val Pro Ser Lys Asn Ser Val Met Thr Ser Gin Met Trp Asn Glu Lys 
20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Leu Gin 
35 40 45 



Val Met lie Ala lie He Asn Leu Ser Leu Gly He He He Leu Thr 
50 55 60 



Thr Leu Phe Ser Glu Leu Pro Thr Ser Val Met Leu Met Val Pro He 
65 70 75 80 
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Trp Gly Ser lie Met Phe lie Val Ser Gly Ser Leu Ser lie Ala Ala 
85 90 95 



Gly Val Thr Pro Thr Lys Cys Leu lie Val Ala Ser Leu Thr Leu Asn 
100 105 110 



Thr He Thr Ser Val Leu Ala Ala Thr Ala Ser He Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr He 
130 ' 135 140 



Thr Lys Gly Leu Asp Val Leu Met Leu He Phe Asn Met Leu Glu Phe 
145 150 155 160 



Cys Leu Ala Val Ser Val Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys 
165 170 175 



Asn Ser Arg Glu Val Leu Val Val Leu Pro Ser Asn Pro Val Glu Thr 
180 185 190 



Val Met Ala Pro Pro Met Thr Leu Gin Pro Leu Leu Pro Ser Glu His 
195 200 205 



Gin Gly Thr Asn Val Pro Gly Asn Val Tyr Lys Asn His Pro Gly Glu 
210 215 220 



He Val 
225 



<210> 23 

<211> 1480 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 

<222> (1) . . (1480) 
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<220> 

<221> CDS 

<222> (107) . . (787) 



<400> 23 

attcggcacg atgggtgaga acacacaatc aaaacttctc ctggaatata tcatctgaca 60 

tcagatcctt cccattgtct gtactgtttc tgctgccctg gaaacc atg caa gga 115 

Met Gin Gly 
1 

cag gaa cag acc acc atg gca gtg gtt cct gga ggt get cca cct tea 163 
Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser 
5 10 15 

gag aat tct gtt atg aaa tea caa atg tgg aac gag aac aag gag aaa 211 
Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn Lys Glu Lys 
20 25 30 35 

ttc ttg aag ggg gaa ccc aaa gtc ctt ggg gtt gta caa gtt atg att 259 
Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin Val Met He 
40 45 50 

get etc ata aac etc age ttc gga ata ata att ttg gca aat eta tct 307 
Ala Leu He Asn Leu Ser Phe Gly He He He Leu Ala Asn Leu Ser 
55 60 65 



tct gaa cca 
Ser Glu Pro 
70 

ata atg ttc 
He Met Phe 
85 

cct aca aaa 
Pro Thr Lys 
100 

tct gtg ttg 
Ser Val Leu 



gtg ggt tea 
Val Gly Ser 



ttg gat att 
Leu Asp He 
150 

gtg tec ate 
Val Ser He 
165 

gag gtt ctt 



etc att tct 
Leu He Ser 



att gtc tea 
He Val Ser 



aag ctg ate 
Lys Leu lie 
105 

get gca act 
Ala Ala Thr 
120 

cag ttt ccg 
Gin Phe Pro 
135 

ttg atg tta 
Leu Met Leu 



tct get ttt 
Ser Ala Phe 



gta gtg eta 



gtg gtc tta 
Val Val Leu 
75 

gga tec ctg 
Gly Ser Leu 
90 

ate age agt 
He Ser Ser 



gca age ata 
Ala Ser He 



ttt egg tat 
Phe Arg Tyr 
140 

att tta aat 
He Leu Asn 
155 

99^ tgt aaa 
Gly Cys Lys 
170 

cca tea aat 



atg get cca 
Met Ala Pro 



tea att gca 
Ser He Ala 
95 

eta act ctg 
Leu Thr Leu 
110 

atg ggt gta 
Met Gly Val 
125 

aat tat aca 
Asn Tyr Thr 



atg eta gaa 
Met Leu Glu 



get tec tgt 
Ala Ser Cys 
175 

cct get gtg 



att tgg gga 
He Trp Gly 
80 

gca gga gtg 
Ala Gly Val 



aac act ate 
Asn Thr lie 



gtc agt gtg 
Val Ser Val 
130 

ate acc aag 
He Thr Lys 
145 

ttc tgc att 
Phe Cys He 
160 

tgt aac tec 
Cys Asn Ser 



act gtg atg 



cca 355 
Pro 



aga 403 
Arg 



acc 451 

Thr 

115 

get 499 
Ala 



ggt 547 
Gly 



get 595 
Ala 



age 643 
Ser 



gca 691 
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Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr Val Met Ala 

180 185 190 195 

ccc cct gtg acg ctt caa cca ttg cca cca tea gaa cac caa ggg aaa 739 

Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His Gin Gly Lys 
200 205 210 

aat gtt cca gaa aat gta tat aag aac cac tea gaa gaa ata gtc taa 787 

Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu lie Val 





215 


220 


225 




ttctcttgtg 


tgtatgtgaa 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgta 


tgcatttccc 


847 


tgggaaatta 


acacttcatt 


gccctggctt 


ttggggggaa 


tatcagacat 


gecataaett 


907 


tgtaaatcta 


acatctacta 


gtacaacaac 


actttaacat 


cctccattaa 


ttgtatgtac 


967 


aaggggcaat 


gaattagcaa 


agatgttttg 


aaagcaaaat 


gaaacaaaca 


aacaagecag 


102 7 


cacacaaaaa 


aagacctgtt 


ff „f Off 1.1.- 

ttgecttteg 


caaagattaa 


agaaaggegt 


tcacacctct 


1087 


ttggggcttt 


tgctaaataa 


gtggatttag 


ctgcttttga 


tcatgtacaa 


acacatgaat 


1147 


tatgtatatt 


atattactaa 


tgataggaac 


caatttttat 


ctctgtgcat 


attcatcata 


1207 


taataacata 


ctctatactg 


taattgatat 


ggtaaaaatt 


ataataaaga 


gaagtgactg 


1267 


aaatcggaat 


acacaaacac 


acacacacac 


acacatttac 


atacatagtg 


tatatctctg 


1327 


tatatacata 


gtacgtgtat 


ggtgtgcatg 


tagagattat 


agaaggtata 


catacacata 


1387 


tgtgtacata 


ttacatatga 


cctaaaaaat 


cacttataac 


tgtggtttgg 


tgctgtaatc 


1447 


tcccctaagg 


ctcacatctt 


tgaacagttg 


gtc 






1480 



<210> 24 

<211> 226 

<212> PRT 

<213> Mus musculus 

<400> 24 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 



Pro Pro Ser Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn 
20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin 
35 40 45 



Val Met lie Ala Leu He Asn Leu Ser Phe Gly He He He Leu Ala 
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50 55 60 



Asn Leu Ser Ser Glu Pro Leu lie Ser Val Val Leu Met Ala Pro lie 
65 70 75 80 



Trp Gly Pro lie Met Phe lie Val Ser Gly Ser Leu Ser lie Ala Ala 
85 90 95 



Gly Val Arg Pro Thr Lys Lys Leu lie lie Ser Ser Leu Thr Leu Asn 
100 105 110 



Thr lie Thr Ser Val Leu Ala Ala Thr Ala Ser lie Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie 
130 135 140 



Thr Lys Gly Leu Asp lie Leu Met Leu lie Leu Asn Met Leu Glu Phe 
145 150 155 160 



Cys lie Ala Val Ser lie Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys 
165 170 175 



Asn Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr 
180 185 190 



Val Met Ala Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His 
195 200 205 



Gin Gly Lys Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu 
210 215 220 



He Val 
225 



<210> 25 

<211> 1187 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 
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<222> (1) . . (1187) 
<220> 

<221> CDS 

<222> (36) . . (713) 



<400> 25 

ccattgtctg tactgtttct gctgccctgg aaacc atg caa gga ctg gca cag 53 

Met Gin Gly Leu Ala Gin 
1 5 

acc acc atg gca gtg gtt cct gga ggt get cca cct tea gag aat tct 101 
Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser Glu Asn Ser 
10 15 20 

gtt ata aaa tea caa atg tgg aac aag aac aaa gag aaa ttc ttg aag 149 
Val lie Lys Ser Gin Met Trp Asn Lys Asn Lys Glu Lys Phe Leu Lys 
25 30 35 

ggg gaa ccc aaa gtc etc ggg get ata caa gtt atg att get ttc ata 197 
Gly Glu Pro Lys Val Leu Gly Ala He Gin Val Met He Ala Phe He 
40 45 50 

aac ttc age tta gga ata ata att ata tta aat aga gtt tct gaa cga 245 
Asn Phe Ser Leu Gly He He He He Leu Asn Arg Val Ser Glu Arg 
55 60 65 70 

ttc atg tea gtg etc tta ctg gee cca ttt tgg gga tea ata atg ttc 293 
Phe Met Ser Val Leu Leu Leu Ala Pro Phe Trp Gly Ser He Met Phe 
75 80 85 

att ttc tea gga tec ctg tea att gca gca gga gtg aaa cct aca aaa 341 
He Phe Ser Gly Ser Leu Ser He Ala Ala Gly Val Lys Pro Thr Lys 
90 95 100 

gee atg ate ate age agt eta agt gtg aac act ate agt tct gtg ttg 389 
Ala Met He He Ser Ser Leu Ser Val Asn Thr He Ser Ser Val Leu 
105 110 115 

get gtg gca gca age att att ggc gtc ate agt gtg att tct ggt gtt 437 
Ala Val Ala Ala Ser He He Gly Val He Ser Val He Ser Gly Val 
120 125 130 



ttt cgc caa ttt aga agt caa cca gee ate get agt ttg gat gtt ttg 485 
Phe Arg Gin Phe Arg Ser Gin Pro Ala He Ala Ser Leu Asp Val Leu 
135 140 145 150 

atg aca att ttg aat atg eta gaa ttc tgc att get gtg tec gtc tct 533 
Met Thr He Leu Asn Met Leu Glu Phe Cys He Ala Val Ser Val Ser 
155 160 165 

gca ttt ggg tgt aaa get tec tgt tgt aac tec agt gag gtt ctt gta 581 
Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn Ser Ser Glu Val Leu Val 
170 175 180 
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gtg eta cca tea aat tct get gtg aca gtg aca gca ccc ccc atg ata 62 
Val Leu Pro Ser Asn Ser Ala Val Thr Val Thr Ala Pro Pro Met lie 
185 190 195 

ctt caa cca ttg cca cca tea gaa tgc caa ggg aaa aat gtt cca gaa 67 
Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin Gly Lys Asn Val Pro Glu 
200 205 210 

aat eta tac agg aac caa cca gga gaa ata gtc taa ttttgatgta 72 
Asn Leu Tyr Arg Asn Gin Pro Gly Glu lie Val 



78 
84 
90 
96 
102 
108 
114 
118 



215 


220 


225 






cgtgttttca 


tgtgtgagag 


tgtgtgtgtg 


cacatgtgta 


tgcatttggc 


tttttgtacg 


aatataagac 


atgctctaaa 


taagtaagtc 


aagcatttat 


taagtcaata 


acactttaaa 


atcttctatt 


cattgtatgt 


acaagaagaa 


tttaattggc 


aaaaaaaatt 


ttaatgtaaa 


ctgataaaaa 


ccttggttta 


gtgaggtgaa 


gtctccaaag 


agtgaagaat 


ggagttcatg 


ggtttttttt 


gggggggggt 


tgctaaataa 


ataaatttgg 


ctgctcacga 


tcatatacag 


acacataaac 


tatgtaagac 


agcaggtgta 


ttaccttact 


aataatagca 


accaatttta 


atcctacatg 


aatgttcatc 


ttaatagtat 


atgccttaat 


tgttatggtg 


aaaagttaca 


ataaacaaaa 


gtgattgaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaag 





<210> 26 

<211> 225 

<212> PRT 

<213> Mus musculus 

<400> 26 

Met Gin Gly Leu Ala Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 



Pro Pro Ser Glu Asn Ser Val lie Lys Ser Gin Met Trp Asn Lys Asn 
20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Ala lie Gin 
35 40 45 



Val Met lie Ala Phe lie Asn Phe Ser Leu Gly lie lie lie lie Leu 
50 55 60 



Asn Arg Val Ser Glu Arg Phe Met Ser Val Leu Leu Leu Ala Pro Phe 
65 70 75 80 
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Trp Gly Ser lie Met Phe lie Phe Ser Gly Ser Leu Ser lie Ala Ala 
85 90 95 



Gly Val Lys Pro Thr Lys Ala Met lie lie Ser Ser Leu Ser Val Asn 
100 105 110 



Thr He Ser Ser Val Leu Ala Val Ala Ala Ser He He Gly Val He 
115 120 125 



Ser Val He Ser Gly Val Phe Arg Gin Phe Arg Ser Gin Pro Ala He 
130 135 140 



Ala Ser Leu Asp Val Leu Met Thr He Leu Asn Met Leu Glu Phe Cys 
145 150 155 160 



He Ala Val Ser Val Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn 
165 170 175 



Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Ser Ala Val Thr Val 
180 185 190 



Thr Ala Pro Pro Met He Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin 
195 200 205 



Gly Lys Asn Val Pro Glu Asn Leu Tyr Arg Asn Gin Pro Gly Glu He 
210 215 220 



Val 
225 



<210> 27 

<211> 1687 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1) . . (1687) 
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<220> 

<221> CDS 

<222> (133) . . (867) 



<400> 27 

gttacaagtt cacaagaagg aaccaaggat cagcctgaga gaacccagag ttaaggctct 60 

tcgggttcct gagagctcgg ctggaagtga ctgggtgaca aggcacacag gctcagccgt 120 

gggagctcaa tc atg att cca caa gta gtg acc agt gag act gtg gca atg 171 
Met He Pro Gin Val Val Thr Ser Glu Thr Val Ala Met 
15 10 

att teg cca aat gga atg agt ctt ccc caa aca gac aaa ccc cag cct 219 
He Ser Pro Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro 
15 20 25 

ttc cac cag tgg caa gac age ctg aag aaa cat eta aag get gag ate 267 
Phe His Gin Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu He 
30 35 " 40 45 

aaa gtg atg gcg gca ate cag ate atg tgt get gtg atg gtg ttg agt 315 
Lys Val Met Ala Ala He Gin He Met Cys Ala Val Met Val Leu Ser 
50 55 60 

ctg gga ate att ttg gca tct gtt ccc tec aat eta cac ttt acc tea 363 
Leu Gly He lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser 
65 70 75 

gtg ttt tea gtc ctg tta aaa tct ggc tac cca ttt ata gga get ttg 411 
Val Phe Ser Val Leu Leu Lys Ser Gly Tyr Pro Phe He Gly Ala Leu 
80 85 90 

ttt ttt ata gtc tct gga att ctg tec ate gtc acg gag aca aag tea 459 
Phe Phe He Val Ser Gly lie Leu Ser He Val Thr Glu Thr Lys Ser 
95 100 105 

aca aaa att ttg gta gac age age ctg act ctg aat ate ctg agt gtt 507 
Thr Lys He Leu Val Asp Ser Ser Leu Thr Leu Asn He Leu Ser Val 
110 115 120 125 

tea ttt get ttc atg ggc ate att ate ate tct gtc age ctg get ggt 555 
Ser Phe Ala Phe Met Gly He He He He Ser Val Ser Leu Ala Gly 
130 135 140 

ttg cat cct gec tea gag cag tgc ttg cag age aag gag ctt aga cca 603 
Leu His Pro Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro 
145 150 155 

act gaa tat cat tac tac caa ttc ttg gac agg aac gag tgc ttt gee 651 
Thr Glu Tyr His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala 
160 165 170 

gee aag tct gtt ctg get gga gtc ttt tea ctg atg ctg ate agt act 699 
Ala Lys Ser Val Leu Ala Gly Val Phe Ser Leu Met Leu He Ser Thr 
175 180 185 
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atg ttg gaa ctt ggc ctg get gtc etc act gec atg ctg tgg tgg aaa 
Met Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys 
190 195 200 205 



747 



cag agt cac tct aac ate cct ggg aat gtt atg ttc ctg cca cat age 
Gin Ser His Ser Asn lie Pro Gly Asn Val Met Phe .Leu Pro His Ser 
210 215 220 



795 



tea aat aat gac tec aac atg gaa tea aag gta ctt tgt aac ccc tea 
Ser Asn Asn Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser 
225 230 235 



843 



tat gag gaa caa ttg gtt tgt taa gaaaaacaaa acaaaacaaa actaaatacc 
Tyr Glu Glu Gin Leu Val Cys 
240 



897 



acaaaaacaa 



atggaactat acegcagaag atatgtcttc atgataatgc agaaattcca 



957 



accatcacag ggtagcaatg ettgetaett aaaatgtaga ctgttcatac agtgggtacc 1017 

agtatgagtt gaatgtgtgt attactggca ccctattgat tttcatgacc ttggcttcag 1077 

ccaaagccca gacctacaaa tggtggcctt tcttagaaaa ccaaacagaa tgtttcaggc 1137 

cattgtagtg gggaaaaggg acaacatttt cttgccccct gcaattaaca gcaacagtta 1197 

accattagca gtctttgtat tcagaatctc tgctatgacg tgggtcttct atcatatcag 1257 

ttttattcac tggtgtgaat aaacaaggga ectgeaatga agtctgaaga agtttccatg 1317 

ttgttggttc aaaataaagt ctttttgtga ttgtatattc tttttgtatg gggttttgtg 1377 

tgggttttgt tgttgttgtt gttttgcttt tatgeacaga tcaagtccta agtggtacat 1437 

aaacacatgt gtttgetatt ttttttttgt tctctatcaa ccaaaaaaaa aaaaaaaaaa 1497 

ggcatgggaa agagaaaaag gaatatatct gtgtctctgt gtttatgtgg tgtgtgtgtg 1557 

tgtgtgtttt aaactaaaac tgaatgaaca ttcaaagttt agcaatgtat tttggaggta 1617 

ccagataaca tacttatttc tagaatcgee agagcaaatg agagacattt tgtegcttet 1677 

tataagaggc 168 7 

<210> 28 

<211> 244 

<212> PRT 

<213 > Mus mus cuius 

<400> 28 

Met He Pro Gin Val Val Thr Ser Glu Thr Val Ala Met He Ser Pro 
15 10 15 
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Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro Phe His Gin 
20 25 30 



Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met 
35 40 45 



Ala Ala lie Gin lie Met Cys Ala Val Met Val Leu Ser Leu Gly lie 
50 55 60 



lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie 
85 90 95 



Val Ser Gly lie Leu Ser lie Val Thr Glu Thr Lys Ser Thr Lys lie 
100 105 110 



Leu Val Asp Ser Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Met Gly lie lie He He Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro Thr Glu Tyr 
145 150 155 160 



His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala Ala Lys Ser 
165 170 175 



Val Leu Ala Gly Val Phe Ser Leu Met Leu He Ser Thr Met Leu Glu 
180 185 190 



Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys Gin Ser His 
195 200 205 



Ser Asn He Pro Gly Asn Val Met Phe Leu Pro His Ser Ser Asn Asn 
210 215 220 



Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser Tyr Glu Glu 
225 230 235 240 



Gin Leu Val Cys 
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<210> 29 

<211> 899 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1) . . (899) 

<220> 

<221> CDS 

<222> (164) . . (817) 



<400> 29 

aattcggcac taggaggaag ggctggagac agttacaagt tcacaagaag gaaccaagga 60 

tcagcctgag agaacccaga gttaaggctc ttagggtttc tgagagctcg gctggaagtg 12 0 

actgggtgac aaggcacaca ggctcagctg tggcagctcc ate atg att cca cag 175 

Met lie Pro Gin 
1 

gta gtg acc aat gag acc ate aca acg att tea cca aat gga ate aac 223 
Val Val Thr Asn Glu Thr lie Thr Thr lie Ser Pro Asn Gly lie Asn 
5 10 15 20 

ttt ccc caa aaa gac gag tec cag cct acc caa cag agg caa gac age 271 
Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin Arg Gin Asp Ser 
25 30 35 

ctg aag aaa cat eta aag get gag ate aaa gtg ata gtg gca ate cag 319 
Leu Lys Lys His Leu Lys Ala Glu lie Lys Val lie Val Ala lie Gin 
40 45 50 

ate atg tgt get gtg aca gtg ttg get ctg gga ate att ttg gca tct 367 
lie Met Cys Ala Val Thr Val Leu Ala Leu Gly lie lie Leu Ala Ser 
55 60 65 

gtt cct cct gtc cca tat ttt aac tea gtg ttt tct gtc ctg tta aaa 415 
Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser Val Leu Leu Lys 
70 75 80 

tct ggc tac cca ttt ata gga get ttg ttt ttt att gcc tct gga att 463 
Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie Ala Ser Gly lie 
85 90 95 100 

ttg tec ate att acg gag aga aag tea aca aaa cct ttg gta gat gcc 511 
Leu Ser lie lie Thr Glu Arg Lys Ser Thr Lys Pro Leu Val Asp Ala 
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105 110 115 

age eta act ctg aat ate ctg agt gtt tea ttt get ttc gtg ggc ate 559 
Ser Leu Thr Leu Asn lie Leu Ser val Ser Phe Ala Phe Val Gly lie 
120 125 130 

att ate ate tct gtc age ctg get ggt ttg cat cct gee tea gag caa 607 
lie lie lie Ser Val Ser Leu Ala Gly Leu His Pro Ala Ser Glu Gin 
135 140 145 

tgc aag cag age aag gag ctt agt eta att gaa cat gac tac tac caa 655 
Cys Lys Gin Ser Lys Glu Leu Ser Leu lie Glu His Asp Tyr Tyr Gin 
150 155 160 

cct ttc tac aac tea gac agg agt gaa tgt gee gtc ace aag tct att 703 
Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val Thr Lys Ser lie 
165 170 175 180 

ctg act gga gec ctt tea gtg atg ctg ate ate agt gtg ttg gag ctt 751 
Leu Thr Gly Ala Leu Ser Val Met Leu lie lie Ser Val Leu , Glu Leu 
185 190 195 

ggt ctg get ttg etc tct gee atg ctg tgg ttg aga gag ggt gtt ctg 799 
Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg Glu Gly Val Leu 
200 205 210 



act tec ctt aga atg tga ttttttctgt cttataactc caataatgaa 847 
Thr Ser Leu Arg Met 
215 

gccaatgtgg aatcaaaggc actttgtaac tgtgcatatg aggaacgatt gt 899 



<210> 30 

<211> 217 

<212> PRT 

<213> Mus musculus 

<400> 30 

Met lie Pro Gin Val Val Thr Asn Glu Thr lie Thr Thr lie Ser Pro 
15 10 15 



Asn Gly lie Asn Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin 
20 25 30 



Arg Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu He Lys Val lie 
35 40 45 



Val Ala He Gin He Met Cys Ala Val Thr Val Leu Ala Leu Gly He 
50 55 60 
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lie Leu Ala Ser Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie 
85 90 95 



Ala Ser Gly lie Leu Ser lie lie Thr Glu Arg Lys Ser Thr Lys Pro 
100 105 110 



Leu Val Asp Ala Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Val Gly He He He He Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Lys Gin Ser Lys Glu Leu Ser Leu He Glu His 
145 150 155 160 



Asp Tyr Tyr Gin Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val 
165 170 175 



Thr Lys Ser lie Leu Thr Gly Ala Leu Ser Val Met Leu He He Ser 
180 185 190 



Val Leu Glu Leu Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg 
195 200 205 



Glu Gly Val Leu Thr Ser Leu Arg Met 
210 215 



<210> 31 

<211> 1370 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDRA 

<222> (1) . . (1370) 

<220> 
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<221> CDS 

<222> (110) . . (853) 



<400> 31 

caaggatcag actgagaaaa cccatagtta aagctcttgg ggtttctgag agctcagctg 60 

10 gaagtgactg ggtgacaagg cgcacaggct cagccgtgga agctccatc atg att cca 118 

Met lie Pro 
1 

caa gta gtg acc agt gag act gtc aca gtg att tea cca aat gga ate 166 
15 Gin Val Val Thr Ser Glu Thr Val Thr Val lie Ser Pro Asn Gly He 

5 10 15 

age ttt ccc caa aca gac aaa ccc cag cct tec cac cag age caa gac 214 
Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin Ser Gin Asp 
20 20 25 30 35 

age ctg aag aaa cat eta aag get gag ate aaa gtg atg gcg gca ate 262 

Ser Leu Lys Lys His Leu Lys Ala Glu He Lys Val Met Ala Ala He 

40 45 50 

25 

cag ate atg tgt get gtg atg gtg ttg agt ctg gga ate att ttg gca 310 

Gin He Met Cys Ala Val Met Val Leu Ser Leu Gly He He Leu Ala 
55 60 65 

30 tct gtt ccc tec aat eta cac ttt acc tea gtg ttt tec ate ctg tta 358 

Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser He Leu Leu 
70 75 80 

gaa tct ggc tac cca ttt gta gga get ttg ttt ttt gec ate tct gga 406 
35 Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala He Ser Gly 
85 90 95 

att ctg tct att gtc aca gag aaa aag atg act aag cct ttg gtt cac 454 
He Leu Ser He Val Thr Glu Lys Lys Met Thr Lys Pro Leu Val His 
40 100 105 110 115 

age age eta gec ctg age ate ctg agt gtc etc tct get ctt aca ggc 502 
Ser Ser Leu Ala Leu Ser He Leu Ser Val Leu Ser Ala Leu Thr Gly 
120 125 130 



45 



ate get att etc tct gtc agt ttg get get tta gag cct gec ttg cag 550 
He Ala He Leu Ser Val Ser Leu Ala Ala Leu Glu Pro Ala Leu Gin 
135 140 145 



50 caa tgt aag ctg get ttc aca caa eta gac aca acc caa gac get tat 598 

Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin Asp Ala Tyr 
150 155 160 

cat ttc ttt age cct gag cca tta aac age tgc ttc gtg gec aaa get 646 
55 His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val Ala Lys Ala 

165 170 175 

get ctg act gga gtc ttt tea ctg atg eta ate age agt gtg ttg gag '694 
Ala Leu Thr Gly Val Phe Ser Leu Met Leu He Ser Ser Val Leu Glu 
60 180 185 190 195 
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ctt ggc ctg get gtc etc act gec aca ctg tgg tgg aaa cag age tec 742 
Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys Gin Ser Ser 
200 205 210 

tct get ttc tct ggg aat gtg att ttc ctg tct cag aac tea aag aat 790 
Ser Ala Phe Ser Gly Asn Val lie Phe Leu Ser Gin Asn Ser Lys Asn 
215 220 225 

aaa tec agt gta tct tea gag tea ctt tgt aac cct aca tat gaa aac 838 
Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr Tyr Glu Asn 
230 235 240 

ata ttg act tea taa gaattaagta gaggttatat agcagaaaaa tctgtcttta 893 
lie Leu Thr Ser 
245 

953 
1013 
1073 
1133 
1193 
1253 
1313 
1370 



<210> 32 

<211> 247 

<212> PRT 

<213> Mus musculus 

<400> 32 

Met lie Pro Gin Val Val Thr Ser Glu Thr Val Thr Val lie Ser Pro 
15 10 15 



Asn Gly lie Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin 
20 25 30 



Ser Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met 
35 40 45 



Ala Ala He Gin He Met Cys Ala Val Met Val Leu Ser Leu Gly He 
50 55 60 



acatgattta 


gaaaagecat 


ttactgtgtg 


acaacaatgc 


ttaatatctt 


gatatcttaa 


tgtgtgtatt 


ggttaatcag 


caaccatgaa 


aaacatacta 


actggctggg 


ttcagtagca 


cgctcttgat 


ttggtgtcag 


tcaaaacaca 


gacctgtaaa 


ttccaattta 


tgtagtggtc 


aaagagcccc 


aattattttc 


tcaaaaaact 


ggaagaatgt 


ttcataggat 


catggtggag 


ccaatgggca 


acagttcttc 


ttatccttgt 


cacttggctg 


caggaggtac 


tgactagggc 


ctgagatcat 


attctgtgtg 


cgtggcatgg 


acttcatggc 


atctatttta 


ttcataagca 


catgaaaaca 


agtcatctct 


tatgaagtct 


caaagagcat 


aaaaaagtta 


gcctccaaat 


aaagtcttta 


tataatcccc 


ccaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaa 
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lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



lie Leu Leu Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala 
85 90 95 



lie Ser Gly lie Leu Ser lie Val Thr Glu Lys Lys Met Thr Lys Pro 
100 105 110 



Leu Val His Ser Ser Leu Ala Leu Ser lie Leu Ser Val Leu Ser Ala 
115 120 125 



Leu Thr Gly lie Ala lie Leu Ser Val Ser Leu Ala Ala Leu Glu Pro 
130 135 140 



Ala Leu Gin Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin 
145 150 155 160 



Asp Ala Tyr His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val 
165 170 175 



Ala Lys Ala Ala Leu Thr Gly Val Phe Ser Leu Met Leu lie Ser Ser 
180 185 190 



Val Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys 
195 200 205 



Gin Ser Ser Ser Ala Phe Ser Gly Asn Val lie Phe Leu Ser Gin Asn 
210 215 220 



Ser Lys Asn Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr 
225 230 235 240 



Tyr Glu Asn lie Leu Thr Ser 
245 



<210> 33 

<211> 938 

<212> DNA 

<213> Mus musculus 



-45- 



WO 02/062946 



PCT/US01/48437 



10 



<220> 

<221> CDNA 
<222> (1) . . (938) 



<220> 
<221> CDS 
<222> (105) . . (809) 

15 

<400> 33 

aacagctggt aagaggttga gtctgagttc cccaggagca gagtgagcag gcccaggagc 60 

20 tgcagcacac aggacagggc tgtgaggaca gcatccctgg cacc atg cgt eta cag 116 

Met Arg Leu Gin 
1 

ctt ggc acc aag aac att ggg tgg gac tgc ttt cca aag gac ate att 164 
25 Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro Lys Asp lie lie 
5 10 15 20 

ate cac aaa aga gag aaa act gga cat aca tat gaa aaa gaa gat gac 212 
lie His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu Lys Glu Asp Asp 
30 25 30 35 

ctg ctg att gga gtg cct agt gaa gee aca ctt ctt gga acc ate cag 260 
Leu Leu lie Gly Val Pro Ser Glu Ala Thr Leu Leu Gly Thr lie Gin 
40 45 50 



35 

eta eta tgt gee ctg ata ctt gca age ttt ggg ggc att ttg gtg tea 308 
Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly lie Leu Val Ser 
55 60 65 

40 get tec tac ttc aac cca gaa gtt tec acc act ttg ata tct gga tac 356 

Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu lie Ser Gly Tyr 
70 75 80 

ctg ttt att gga tec etc tgt ttt gee att get gga att etc tec att 404 
45 Leu Phe lie Gly Ser Leu Cys Phe Ala lie Ala Gly lie Leu Ser lie 

85 90 95 100 

ate tct gaa aaa ate tea acc aaa ccc ttt gca ctg age age ctg gee 452 
lie Ser Glu Lys lie Ser Thr Lys Pro Phe Ala Leu Ser Ser Leu Ala 
50 105 110 115 

tec aat gta gca age tct gtt gtt gee gtc att ggc etc ttc etc ttc 500 
Ser Asn Val Ala Ser Ser Val Val Ala Val lie Gly Leu Phe Leu Phe 
120 125 130 



55 



acc tat tgt ctg ata gec ctg ggg agt get ttc cca cac tgt aac tea 548 
Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro His Cys Asn Ser 
135 140 145 



60 gaa aag aaa ttc ctg tec tta ttg tct tac ttg aaa tec cac cat tgg 596 
Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys Ser His His Trp 
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150 155 160 



aag aat gaa gac aag aac tgt tac ctg get tat gtc ggt get atg agt 644 
Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val Gly Ala Met Ser 
165 170 175 180 

get ctg ggg atg atg ctt etc ttc act gtg ctg gag gtt ttc eta get 692 
Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu Val Phe Leu Ala 
185 190 195 

gga tac agt tct ate ttt tgg tgg aag caa gtc tac tec aac aaa cct 740 
Gly Tyr Ser Ser lie Phe Trp Trp Lys Gin Val Tyr Ser Asn Lys Pro 
200 205 210 

999 ggt aca ttt ttc ttg cct caa tea caa gat cat ace caa ctt gtc 788 
Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His Thr Gin Leu Val 
215 220 225 

aaa age aat ttg ttg caa taa tggatatgaa ataaaaaaga agaaagaaga 83 9 

Lys Ser Asn Leu Leu Gin 
230 

aatctgttgc tcacagcaga gtacagctag acttgectga aatctcagct ttggtagatc 899 
ctaataaaca tctgaagaaa aaaaaaaaaa aaaaaaaaa 93 8 



<210> 34 

<211> 234 

<212> PRT 

<213> Mus musculus 

<400> 34 

Met Arg Leu Gin Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro 
15 10 15 



Lys Asp lie lie lie His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu 
20 25 30 



Lys Glu Asp Asp Leu Leu lie Gly Val Pro Ser Glu Ala Thr Leu Leu 
35 40 45 



Gly Thr lie Gin Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly 
50 55 60 



lie Leu Val Ser Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu 
65 70 75 80 



lie Ser Gly Tyr Leu Phe lie Gly Ser Leu Cys Phe Ala lie Ala Gly 
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85 90 95 



lie Leu Ser lie lie Ser Glu Lys lie Ser Thr Lys Pro Phe Ala Leu 
100 105 110 



Ser Ser Leu Ala Ser Asn Val Ala Ser Ser Val Val Ala Val lie Gly 
115 120 125 



Leu Phe Leu Phe Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro 
130 135 140 



His Cys Asn Ser Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys 
145 150 155 160 



Ser His His Trp Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val 
165 170 175 



Gly Ala Met Ser Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu 
180 185 190 



Val Phe Leu Ala Gly Tyr Ser Ser lie Phe Trp Trp Lys Gin Val Tyr 
195 200 205 



Ser Asn Lys Pro Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His 
210 215 220 



Thr Gin Leu Val Lys Ser Asn Leu Leu Gin 
225 230 



<210> 35 

<211> 1025 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1) . . (1025) 

<220> 
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<221> CDS 

<222> (12) . . (818) 



<400> 35 

aattcgtcga c atg gaa ccg gag caa gag aga ctt act tgg caa ccg ggc 50 

Met Glu Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly 
15 10 

aca gta age atg aat aca gta act tea ccg ggc ccc atg gec aat tct 98 
Thr Val Ser Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser 
15 20 25 

gta tat gtg gtg gca ccg ccc aac agt tat cct gtg gtc cca gga acc 146 
Val Tyr Val Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr 
30 35 40 45 

gtg cct cag atg cct att tat ccc age aat cag cct caa gtc cat gtg 194 
Val Pro Gin Met Pro lie Tyr Pro Ser Asn Gin Pro Gin Val His Val 
50 55 60 

att tct ggt cat ctg cct ggt ttg gtg cca get atg acc gaa cca cct 242 
lie Ser Gly His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro 
65 70 75 

gee cag aga gtc ttg aaa aag gga caa gtc eta ggg gec ate cag ate 290 
Ala Gin Arg Val Leu Lys Lys Gly Gin Val Leu Gly Ala lie Gin lie 
80 85 90 

ctg ate ggc ctg gtg cac ate ggc ctg ggc tec ate atg ate acc aac 33 8 

Leu lie Gly Leu Val His lie Gly Leu Gly Ser lie Met lie Thr Asn 
95 ~ 100 105 

etc ttc age cac tat aca cct gtc tec etc tat gga ggt ttc ccc ttc 386 
Leu Phe Ser His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe 
110 115 120 125 

tgg gga ggc ate tgg ttt ate att tea gga tct etc tea gtg gcg gca 434 
Trp Gly Gly lie Trp Phe He He Ser Gly Ser Leu Ser Val Ala Ala 
130 135 140 

gaa act cag ccc aat tea cct tgc ctg ctg aat ggc agt gtg ggc ttg 482 
Glu Thr Gin Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu 
145 150 155 

aac ate ttc age gee ate tgt tct gcg gtg ggc ate atg etc ttc ate 530 
Asn 'He Phe Ser Ala He Cys Ser Ala Val Gly He Met Leu Phe He 
160 165 170 

aca gac ate agt ate tea tea gga tac ate tac cct age tac tac cct 578 
Thr Asp He Ser He Ser Ser Gly Tyr He Tyr Pro Ser Tyr Tyr Pro 
175 180 185 

tac cag gag aac ttg ggt gtg aga aca ggc gtg get att tct agt gtg 62 6 

Tyr Gin Glu Asn Leu Gly Val Arg Thr Gly Val Ala He Ser Ser Val 
190 195 200 205 

eta etc ate ttc tgc etc ttg gag etc age att gca agt gtg tec tec 674 
Leu Leu He Phe Cys Leu Leu Glu Leu Ser He Ala Ser Val Ser Ser 
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210 215 220 

cac ttt ggc tgc caa gtg gcc tgt tgt cac tac aac aat cca ggt gtg 722 
His Phe Gly Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val 
225 230 235 

gtc att cca aat gtc tat gca gca aac cca gtg gtc ate cca gaa cca 770 
Val lie Pro Asn Val Tyr Ala Ala Asn Pro Val Val lie Pro Glu Pro 
240 245 250 

cca aac cca ata cca agt tat tec gaa gta gtt caa gac tec aga taa 818 
Pro Asn Pro lie Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 
255 260 265 

gtgaacctga agattctaga gaccaagtga catcctctcc tacctagact cctataaacc 878 

aagttcttcc tttcctgacg aagggtaaat atctttcttg tggcctaaat tatagactct 93 8 

tgcttcaact caccctggaa aaatctctat taaaacgaga tgggagattg aaatggattc 998 

aaataaagat gctctagccg gaaaaaa 1025 

<210> 36 

<211> 268 

<212> PRT 

<213> Mus musculus 



<400> 36 

Met Gin Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly Thr Val Ser 
1 5 ~ 10 15 



Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser Val Tyr Val 
20 25 30 



Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr Val Pro Gin 
35 40 45 



Met Pro lie Tyr Pro Ser Asn Gin Pro Gin Val His Val lie Ser Gly 
50 55 60 



His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro Ala Gin Arg 
65 70 75 80 



Val Leu Lys Lys Gly Gin Val Leu Gly Ala lie Gin lie Leu lie Gly 
85 90 95 



Leu Val His lie Gly Leu Gly Ser lie Met lie Thr Asn Leu Phe Ser 
100 105 110 
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His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
115 120 125 



lie Trp Phe lie lie Ser Gly Ser Leu Ser Val Ala Ala Glu Thr Gin 
130 135 140 



Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu Asn lie Phe 
145 150 155 160 



Ser Ala lie Cys Ser Ala Val Gly He Met Leu Phe He Thr Asp He 
165 170 175 



Ser He Ser Ser Gly Tyr He Tyr Pro Ser Tyr Tyr Pro Tyr Gin Glu 
180 185 190 



Asn Leu Gly Val Arg Thr Gly Val Ala He Ser Ser Val Leu Leu He 
195 ~ 200 205 



Phe Cys Leu Leu Glu Leu Ser He Ala Ser Val Ser Ser His Phe Gly 
210 215 220 



Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val Val He Pro 
225 230 235 240 



Asn Val Tyr Ala Ala Asn Pro Val Val He Pro Glu Pro Pro Asn Pro 
245 250 255 



He Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 
260 265 



<210> 37 

<211> 1038 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1) . . (1038) 



-51- 



WO 02/062946 



PCT/US01/48437 



<220> 

<221> CDS 

<222> (75) . . (878) 



<400> 37 

gctctggatg gtcaggcaca gctgaagcct acaacagagt tcccggggcc agccagggtt 60 

cccggccagg gtca atg get ggc caa get ccc aca gcg gtt ccc ggc agt 110 
Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser 
15 10 

gtt act gga gaa gtc tea cga tgg cag aac eta gga cct gee cag cct 158 
Val Thr Gly Glu Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro 
15 20 25 

gca cag aaa gta gee cag ccc caa aac ctg gtt cca gat ggg cac ctt 206 
Ala Gin Lys Val Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu 
30 35 40 

gag aaa gec ctg gag gga agt gac ctt eta cag aag ttg ggt ggc ttt 254 
Glu Lys Ala Leu Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe 
45 50 55 60 

cac ate gee att get ttt gca cac ttg gec ttt ggt ggc tac ttg ate 302 
His lie Ala lie Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu lie 
65 70 75 

tec aca gtc aag aat ctg cac ctg gtg gtg ttg aag tgc tgg tat cca 350 
Ser Thr Val Lys Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro 
80 85 90 

etc tgg gga act gtc tct ttt etc gtt gca ggg atg gcg gee atg aca 398 
Leu Trp Gly Thr Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr 
95 100 105 



aca gtg aca ttc cct aag ace tct ctg aag gtt etc tgt gtg ata gee 446 
Thr Val Thr Phe Pro Lys Thr Ser Leu Lys Val Leu Cys Val lie Ala 
110 115 120 

aac gtt ate age ttg ttc tgc gca ctg gee ggc ttc ttt gtc att gee 494 
Asn Val lie Ser Leu Phe Cys Ala Leu Ala Gly Phe Phe Val lie Ala 
125 130 135 140 

aag gac etc ttc ctg gag ggt cct ttt cca tgg cca ate tgg aga cca 542 
Lys Asp Leu Phe Leu Glu Gly Pro Phe Pro Trp Pro lie Trp Arg Pro 
145 150 155 

tac cct gaa ccc aca ace tac ate caa agg eta gag ctg ace ctg ttc 590 
Tyr Pro Glu Pro Thr Thr Tyr lie Gin Arg Leu Glu Leu Thr Leu Phe 
160 165 170 

tgc ttc ace ttc ctg gag ate ttc ctg teg ggg tec aca gee ate aca 638 
Cys Phe Thr Phe Leu Glu lie Phe Leu Ser Gly Ser Thr Ala lie Thr 
175 180 185 

gec tat agg atg aaa cgc ctg caa gca gag gac aag gat gac ace cct 686 
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Ala Tyr Arg Met Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro 
190 195 200 

ttc gtt cct gac aca cct atg gaa ctt aaa ggt eta teg ctg ggg cca 734 
Phe Val Pro Asp Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro 
205 210 215 220 

cca cca tec tat aaa gat gtg gec caa gga cac tec tec tct gat act 782 
Pro Pro Ser Tyr Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr 
225 230 235 

ggc aga gec ttg gec acc age tct gga ctg ctg ctg gee tct gac tec 830 
Gly Arg Ala Leu Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser 
240 245 250 

ttc cat caa get ctg ctt cac acg ggc cca agg acc ttg aga aag tag 878 
Phe His Gin Ala Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 
255 260 265 

tctaagacta ctagactttc ccaataaaga ctgagttcct geaatgecaa gtccttccct 93 8 

gcctcccatt ggctggactc eggtgatgae acaggcgtga ettgeacaaa taaatgeata 998 

accactatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 103 8 

<210> 38 

<211> 267 

<212> PRT 

<213> Mus musculus 



<400> 38 

Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser Val Thr Gly Glu 
1 5 10 15 



Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro Ala Gin Lys Val 
20 25 30 



Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu Glu Lys Ala Leu 
35 40 45 



Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe His lie Ala lie 
50 55 60 



Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu lie Ser Thr Val Lys 
65 70 75 80 



Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro Leu Trp Gly Thr 
85 90 95 
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Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr Thr Val Thr Phe 
100 105 110 



Pro Lys Thr Ser Leu Lys Val Leu Cys Val lie Ala Asn Val lie Ser 
115 120 125 



Leu Phe Cys Ala Leu Ala Gly Phe Phe Val lie Ala Lys Asp Leu Phe 
130 135 140 



Leu Glu Gly Pro Phe Pro Trp Pro He Trp Arg Pro Tyr Pro Glu Pro 
145 150 155 160 



Thr Thr Tyr He Gin Arg Leu Glu Leu Thr Leu Phe Cys Phe Thr Phe 
165 170 175 



Leu Glu He Phe Leu Ser Gly Ser Thr Ala He Thr Ala Tyr Arg Met 
180 185 190 



Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro Phe Val Pro Asp 
195 200 205 



Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro Pro Pro Ser Tyr 
210 215 220 



Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr Gly Arg Ala Leu 
225 230 235 240 



Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser Phe His Gin Ala 
245 250 255 



Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 
260 265 



<210> 39 

<211> 1146 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1146) 
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<220> 

<221> CDS 

<222> (134) . . (1027) 
<300> 

<301> Tedder, TF, Klejman, G, Schlossman, SF, Saito, H 

<3 02> Structure of the gene encoding the human B lymphocyte 
differentiation antigen CD20 (Bl) 

<303> J Immunol 

<304> 142 

<305> 7 

<306> 2560-2568 

<307> 1989- - 

<309> - - 

<300> 

<301> Tedder, TF, Streuli, M, Schlossman, SF, and Saito H 

<302> Isolation and structure of a cDNA encoding the Bl (CD20) cell- 
surface antigen of human B lymphocytes 

<303> Proc Natl Acad Sci USA 

<304> 85 

<305> 1 

<30.6> 208-212 

<307> 1988- - 

<309> - - 



<400> 39 

cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 60 

agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 120 

ttttgagagc aaa atg aca aca ccc aga aat tea gta aat ggg act ttc 169 
Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe 
15 10 

ccg gca gag cca atg aaa ggc cct att get atg caa tct ggt cca aaa 217 
Pro Ala Glu Pro Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys 
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15 20 25 

cca etc ttc agg agg atg tct tea ctg gtg ggc ccc acg caa age ttc 265 
Pro Leu Phe Arg Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe 
30 35 40 

ttc atg agg gaa tct aag act ttg ggg get gtc cag att atg aat ggg 313 
Phe Met Arg Glu Ser Lys Thr Leu Gly Ala Val Gin lie Met Asn Gly 
45 50 55 60 



etc ttc cac att gec ctg ggg ggt ctt ctg atg ate cca gca ggg ate 361 
Leu Phe His lie Ala Leu Gly Gly Leu Leu Met lie Pro Ala Gly lie 
65 70 75 



tat gca ccc ate tgt gtg act gtg tgg tac cct etc tgg gga ggc att 409 
Tyr Ala Pro lie Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly lie 
80 85 90 

atg tat att att tec gga tea etc ttg gca gca acg gag aaa aac tct 457 
Met Tyr lie lie Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser 
95 100 105 

agg aag tgt ttg gtc aaa gga aaa atg ata atg aat tea ttg age etc 505 
Arg Lys Cys Leu Val Lys Gly Lys Met lie Met Asn Ser Leu Ser Leu 
110 115 120 

ttt get gec att tct gga atg att ctt tea ate atg gac ata ctt aat 553 
Phe Ala Ala lie Ser Gly Met lie Leu Ser lie Met Asp lie Leu Asn 
125 130 135 140 

att aaa att tec cat ttt tta aaa atg gag agt ctg aat ttt att aga 601 
lie Lys lie Ser His Phe Leu Lys Met Glu Ser Leu Asn Phe lie Arg 
145 150 155 

get cac aca cca tat att aac ata tac aac tgt gaa cca get aat ccc 649 
Ala His Thr Pro Tyr lie Asn lie Tyr Asn Cys Glu Pro Ala Asn Pro 
160 165 170 

tct gag aaa aac tec cca tct ace caa tac tgt tac age ata caa tct 697 
Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr Cys Tyr Ser lie Gin Ser 
175 180 185 

ctg ttc ttg ggc att ttg tea gtg atg ctg ate ttt gec ttc ttc cag 745 
Leu Phe Leu Gly lie Leu Ser Val Met Leu lie Phe Ala Phe Phe Gin 
190 195 200 

gaa ctt gta ata get ggc ate gtt gag aat gaa tgg aaa aga acg tgc 793 
Glu Leu Val lie Ala Gly lie Val Glu Asn Glu Trp Lys Arg Thr Cys 
205 210 215 220 

tec aga ccc aaa tct aac ata gtt etc ctg tea gca gaa gaa aaa aaa 841 
Ser Arg Pro Lys Ser Asn lie Val Leu Leu Ser Ala Glu Glu Lys Lys 
225 230 235 

gaa cag act att gaa ata aaa gaa gaa gtg gtt ggg eta act gaa aca 889 
Glu Gin Thr lie Glu He Lys Glu Glu Val Val Gly Leu Thr Glu Thr 
240 245 250 

tct tec caa cca aag aat gaa gaa gac att gaa att att cca ate caa 937 
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Ser Ser Gin Pro Lys Asn Glu Glu Asp lie Glu lie lie Pro lie Gin 
255 260 265 

gaa gag gaa gaa gaa gaa aca gag acg aac ttt cca gaa cct ccc caa 985 
Glu Glu Glu Glu Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin 
270 275 280 

gat cag gaa tec tea cca ata gaa aat gac age tct cct taa 1027 
Asp Gin Glu Ser Ser Pro lie Glu Asn Asp Ser Ser Pro 
285 290 295 

gtgatttctt ctgttttctg tttccttttt taaacattag tgttcatagc ttccaagaga 1087 
catgetgact ttcatttctt gaggtactct gcacatacgc accacatctc tatctggee 1146 

<210> 40 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<400> 40 

Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe Pro Ala Glu Pro 
1 5 10 15 



Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys Pro Leu Phe Arg 
20 25 30 



Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe Phe Met Arg Glu 
35 40 45 



Ser Lys Thr Leu Gly Ala Val Gin lie Met Asn Gly Leu Phe His lie 
50 55 60 



Ala Leu Gly Gly Leu Leu Met lie Pro Ala Gly lie Tyr Ala Pro lie 
65 70 75 80 



Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly lie Met Tyr lie lie 
85 90 95 



Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser Arg Lys Cys Leu 
100 105 110 



Val Lys Gly Lys Met lie Met Asn Ser Leu Ser Leu Phe Ala Ala lie 
115 120 125 



Ser Gly Met lie Leu Ser lie Met Asp lie Leu Asn lie Lys lie Ser 
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130 135 140 



His Phe Leu Lys Met Glu Ser Leu Asn Phe lie Arg Ala His Thr Pro 
145 150 155 160 



Tyr lie Asn lie Tyr Asn Cys Glu Pro Ala Asn Pro Ser Glu Lys Asn 
165 170 175 



Ser Pro Ser Thr Gin Tyr Cys Tyr Ser lie Gin Ser Leu Phe Leu Gly 
180 185 190 



lie Leu Ser Val Met Leu lie Phe Ala Phe Phe Gin Glu Leu Val He 
195 200 205 



Ala Gly He Val Glu Asn Glu Trp Lys Arg Thr Cys Ser Arg Pro Lys 
210 215 220 



Ser Asn He Val Leu Leu Ser Ala Glu Glu Lys Lys Glu Gin Thr He 
225 230 235 240 



Glu He Lys Glu Glu Val Val Gly Leu Thr Glu Thr Ser Ser Gin Pro 
245 250 255 



Lys Asn Glu Glu Asp He Glu He He Pro He Gin Glu Glu Glu Glu 
260 265 270 



Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin Asp Gin Glu Ser 
275 280 285 



Ser Pro He Glu Asn Asp Ser Ser Pro 
290 295 



<210> 41 

<211> 894 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDNA 

<222> (1) . . (894) 
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<220> 

<221> CDS 

<222> (51) . . (785) 

<300> 

<301> Kuster, H, Zhang, L, Brini, AT , MacGlashan, DWJ, Kinet, J-P 

<302> The gene and cDNA for the human high affinity immunoglobulin E 
receptor beta chain and expression of the complete human receptor 

<303> J Biol Cham 

<304> 267 

<305> 18 

<306> 12782-12787 

<307> 1992- - 

<309> - - 



<400> 41 

tctcaatata ataatattct ttattcctgg acagctcggt taatgaaaaa atg gac 56 

Met Asp 
1 

aca gaa agt aat agg aga gca aat ctt get etc cca cag gag cct tec 104 
Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu Pro Ser 
5 10 15 

agt gtg cct gca ttt gaa gtc ttg gaa ata tct ccc cag gaa gta tct 152 
Ser Val Pro Ala Phe Glu Val Leu Glu lie Ser Pro Gin Glu Val Ser 
20 25 30 



tea ggc aga eta ttg aag teg gee tea tec cca cca ctg cat aca tgg 200 
Ser Gly Arg Leu Leu Lys Ser Ala Ser Ser Pro Pro Leu His Thr Trp 
35 40 45 50 

ctg aca gtt ttg aaa aaa gag cag gag ttc ctg ggg gta aca caa att 24 8 

Leu Thr Val Leu Lys Lys Glu Gin Glu Phe Leu Gly Val Thr Gin lie 
55 60 65 

ctg act get atg ata tgc ctt tgt ttt gga aca gtt gtc tgc tct gta 296 
Leu Thr Ala Met lie Cys Leu Cys Phe Gly Thr Val Val Cys Ser Val 
70 75 80 

ctt gat att tea cac att gag gga gac att ttt tea tea ttt aaa gca 344 
Leu Asp lie Ser His lie Glu Gly Asp lie Phe Ser Ser Phe Lys Ala 
85 90 95 

ggt tat cca ttc tgg gga gee ata ttt ttt tct att tct gga atg ttg 392 
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Gly Tyr Pro Phe Trp Gly Ala lie Phe Phe Ser lie Ser Gly Met Leu 
100 105 110 

tea att ata tct gaa agg aga aat gca aca tat ctg gtg aga gga age 440 
5 Ser lie lie Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg Gly Ser 

115 120 125 130 

ctg gga gca aac act gec age age ata get ggg gga acg gga att ace 488 
Leu Gly Ala Asn Thr Ala Ser Ser lie Ala Gly Gly Thr Gly lie Thr 
10 135 140 145 

ate ctg ate ate aac ctg aag aag age ttg gee tat ate cac ate cac 536 
lie Leu lie lie Asn Leu Lys Lys Ser Leu Ala Tyr lie His lie His 
150 155 160 

15 

agt tgc cag aaa ttt ttt gag acc aag tgc ttt atg get tec ttt tec 584 
Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser Phe Ser 
165 170 175 

20 act gaa att gta gtg atg atg ctg ttt etc acc att ctg gga ctt ggt 632 

Thr Glu lie Val Val Met Met Leu Phe Leu Thr lie Leu Gly Leu Gly 
180 185 190 

agt get gtg tea etc aca ate tgt gga get ggg gaa gaa etc aaa gga 680 
25 Ser Ala Val Ser Leu Thr lie Cys Gly Ala Gly Glu Glu Leu Lys Gly 

195 200 205 210 

aac aag gtt cca gag gat cgt gtt tat gaa gaa tta aac ata tat tea 72 8 

Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn lie Tyr Ser 
30 215 220 225 

get act tac agt gag ttg gaa gac cca ggg gaa atg tct cct ccc att 776 
Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro Pro lie 
230 235 240 



35 



gat tta taa gaatcacgtg tccagaacac tctgattcac agecaaggat 825 
Asp Leu 



40 ccagaaggcc aaggtcttgt taaggggcta ctggaaaaat ttctattctc tccacagcct 885 

gctggtttt 894 

45 <210> 42 

<211> 244 

<212> PRT 

<213> Homo sapiens 



50 



55 <400> . 42 

Met Asp Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu 
15 10 15 

60 

Pro Ser Ser Val Pro Ala Phe Glu Val Leu Glu lie Ser Pro Gin Glu 
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20 25 30 



Val Ser Ser Gly Arg Leu Leu Lys Ser Ala Ser Ser Pro Pro Leu His 
35 40 45 



Thr Trp Leu Thr Val Leu Lys Lys Glu Gin Glu Phe Leu Gly Val Thr 
50 55 60 



Gin lie Leu Thr Ala Met lie Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 



Ser Val Leu Asp lie Ser His lie Glu Gly Asp lie Phe Ser Ser Phe 
85 90 95 



Lys Ala Gly Tyr Pro Phe Trp Gly Ala lie Phe Phe Ser lie Ser Gly 
100 105 110 



Met Leu Ser lie lie Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg 
115 120 125 



Gly Ser Leu Gly Ala Asn Thr Ala Ser Ser lie Ala Gly Gly Thr Gly 
13 0 13 5 14 0 



lie Thr lie Leu lie lie Asn Leu Lys Lys Ser Leu Ala Tyr lie His 
145 150 155 160 



lie His Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser 
165 170 175 



Phe Ser Thr Glu lie Val Val Met Met Leu Phe Leu Thr lie Leu Gly 
180 185 190 



Leu Gly Ser Ala Val Ser Leu Thr lie Cys Gly Ala Gly Glu Glu Leu 
195 200 205 



Lys Gly Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn lie 
210 215 220 



Tyr Ser Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro 
225 230 235 240 



Pro lie Asp Leu 



<210> 43 
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<211> 1646 

<212> DNA 

<213 > homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1646) 

<220> 

<221> CDS 

<222> (99) . . (743) 

<300> 

<301> Adra, CN, Lelias, JM, Kobayashi, H, Kaghad, M, Morrison, P, 
Rowl ey , JD , Lim , B 

<302> Cloning of the cDNA for a hematopoietic cell-specific protein 
related to CD20 and the beta subunit of the high-affinity IgE receptor: 
evidence for a family of proteins with four membrane -spanning regions 

<303> Proc Natl Acad Sci USA 

<304> 91 

<305> 21 

<306> 10178-10182 

<307> 1994- - 

<309> - - 



<400> 43 

tagtgatctt ttctgagtgt ctcctacttg cgacaaggtg gacttgggag gaaagccgtc 60 

tgccaaagcc tgaagcctcc aagccataaa caacccca atg gcc tec cac gaa gtt 116 

Met Ala Ser His Glu Val 
1 5 

gat aat gca gag ctg ggg tea gcc tct gcc cat ggt acc cca ggc agt 164 
Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala His Gly Thr Pro Gly Ser 
10 15 2 0 

gag acg gga cca gaa gag ctg aat act tct gtc tac cac ccc ata aat 212 
Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser Val Tyr His Pro lie Asn 
25 30 35 
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gga tea cca gat tat cag aaa gca aaa tta caa gtt ctt ggg gec ate 260 
Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu Gin Val Leu Gly Ala lie 
40 45 50 

cag ate ctg aat gca gca atg att ctg get ttg ggt gtc ttt ctg ggt 308 
Gin lie Leu Asn Ala Ala Met lie Leu Ala Leu Gly Val Phe Leu Gly 
55 60 65 70 

tec ttg caa tac cca tac cac ttc caa aag cac ttc ttt ttc ttc acc 356 
Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys His Phe Phe Phe Phe Thr 
75 80 85 

ttc tac aca ggc tac ccg att tgg ggt get gtg ttt ttc tgt agt tea 404 
Phe Tyr Thr Gly Tyr Pro lie Trp Gly Ala Val Phe Phe Cys Ser Ser 
90 95 100 

gga acc ttg tct gtt gta gca ggg ata aaa ccc aca aga aca tgg ata 452 
Gly Thr Leu Ser Val Val Ala Gly lie Lys Pro Thr Arg Thr Trp lie 
105 110 115 

cag aac agt ttt gga atg aac att gee agt get aca att gca eta gtg 500 
Gin Asn Ser Phe Gly Met Asn lie Ala Ser Ala Thr lie Ala Leu Val 
120 125 130 

ggg act get ttt etc tea eta aat ata gca gtt aat ate cag tea tta 548 
Gly Thr Ala Phe Leu Ser Leu Asn lie Ala Val Asn lie Gin Ser Leu 
135 140 145 150 

agg a 9t tgt cac tct tea tea gag tea ccg gac eta tgc aat tac atg 596 
Arg Ser Cys His Ser Ser Ser Glu Ser Pro Asp Leu Cys Asn Tyr Met 
155 160 165 

ggc tec ata tea aat ggc atg gtg tct eta ctg ctg att etc acc ttg 644 
Gly Ser lie Ser Asn Gly Met Val Ser Leu Leu Leu lie Leu Thr Leu 
170 175 180 

ctg gaa tta tgc gta act ate tct acc ata gec atg tgg tgc aat gca 692 
Leu Glu Leu Cys Val Thr lie Ser Thr lie Ala Met Trp Cys Asn Ala 
185 190 195 

aac tgc tgt aat tea aga gag gaa att tec tea cct ccc aat tct gtg 740 
Asn Cys Cys Asn Ser Arg Glu Glu lie Ser Ser Pro Pro Asn Ser Val 
200 205 210 

taa tcaagaatac ctccttatga aaataattct gagagcatga atatttgacc 793 

ttaaatctcc agtgactcag agcttcaccc acaaactcag gagaacataa gcctgctcgt 853 

aaagctcaat ccttctatca tggcaccaat cacaagaacc ttggacgttt gactgactct 913 

atcctttctc tcctaactat aaatcctatt tgtgtgtcgt gggtatggaa ggacagatat 973 

atttctttag gcattcttgg atatctgtaa cttctatgat cattactcca aagttgtttc 1033 

cagaaattgg ttctatttct tcttatccac ctactccatt gctttatgag gtttaaggaa 1093 

ggaaggcggt ataatcccta ttcaatatat tttttctaaa atccaacttc tgaccgccca 1153 

gtaggaagaa aaatgagaca ttttttccat tacagagaaa tgcttcttga ctttaacatc 1213 
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agcattataa 


aaagtgtcaa 


ataaaaaatt 


accatcatta 


tcattaaaat 


aaattttcac 


1273 


tgtatttgag 


atgggagggt 


taaggctcag 


ggattttatt 


tcagtgaact 


gctggaactc 


1333 


acacatgccc 


tgatatgtaa 


atgatgattt 


atgttggcga 


gtctgagagc 


aagcccaaat 


1393 


gtgttcttca 


aaggacaatg 


ggaaactgta 


aagtagagaa 


ctaaagaata 


aggcctttag 


1453 


aatctgacac 


atctgggttc 


aaattctgaa 


actgtcactt 


attacctgta 


tgaacatggg 


1513 


caaattatct 


aatctctctg 


atctattttt 


cctcatctgt 


aaaataggtg 


taataataac 


1573 


aactactttg 


tcggttgctc 


tgagggttaa 


atgaaaataa 


aaagaaaatg 


tgaaacagca 


1633 


ccacaggtac 


ttg 










1646 



<210> 44 

<211> 214 

<212> PRT 

<213> homo sapiens 

<400> 44 

Met Ala Ser His Glu Val Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala 
15 10 15 



His Gly Thr Pro Gly Ser Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser 
20 25 30 



Val Tyr His Pro lie Asn Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu 
35 40 45 



Gin Val Leu Gly Ala lie Gin lie Leu Asn Ala Ala Met lie Leu Ala 
50 55 60 



Leu Gly Val Phe Leu Gly Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys 
65 70 75 80 



His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro lie Trp Gly Ala 
85 90 95 



Val Phe Phe Cys Ser Ser Gly Thr Leu Ser Val Val Ala Gly lie Lys 
100 105 110 



Pro Thr Arg Thr Trp lie Gin Asn Ser Phe Gly Met Asn lie Ala Ser 
115 120 125 
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Ala Thr lie Ala Leu Val Gly Thr Ala Phe Leu Ser Leu Asn lie Ala 
130 135 140 



Val Asn lie Gin Ser Leu Arg Ser Cys His Ser Ser Ser Glu Ser Pro 
145 150 155 160 



Asp Leu Cys Asn Tyr Met Gly Ser lie Ser Asn Gly Met Val Ser Leu 
165 170 175 



Leu Leu lie Leu Thr Leu Leu Glu Leu Cys Val Thr lie Ser Thr lie 
180 185 190 



Ala Met Trp Cys Asn Ala Asn Cys Cys Asn Ser Arg Glu Glu lie Ser 
195 ^ 200 205 



Ser Pro Pro Asn Ser Val 
210 



<210> 45 

<211> 1182 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1182) 

<220> 

<221> CDS 

<222> (58) . . (861) 



<400> 45 

acacaggttg gagcagagaa agaggaaaca tagaggtgcc aaaggaacaa agacata 57 

atg atg tea tec aag cca aca age cat get gaa gta aat gaa ace ata 105 
Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr lie 
15 10 15 

ccc aac cct tac cca cca ggc age ttt atg get cct gga ttt caa cag 153 
Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 
20 25 30 



-65- 



WO 02/062946 



PCT/US01/48437 



cct ctg ggt tea ate aac tta gaa aac caa get cag ggt get cag cgt 201 
Pro Leu Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
35 40 45 

get cag ccc tac ggc ate aca tct ccg gga ate ttt get age agt caa 249 
Ala Gin Pro Tyr Gly lie Thr Ser Pro Gly lie Phe Ala Ser Ser Gin 
50 55 60 

cc 9 99t caa 99 a aat ata caa at 9 ata aat cca a 9 t 9 fc 9 99 a aca 9 ca 297 
Pro Gly Gin Gly Asn He Gin Met He Asn Pro Ser Val Gly Thr Ala 
65 70 75 80 

gta atg aac ttt aaa gaa gaa gca aag gca eta ggg gtg ate cag ate 345 
Val Met Asn Phe Lys Glu Glu Ala Lys Ala Leu Gly Val He Gin He 
85 90 95 

atg gtt gga ttg atg cac att ggt ttt gga att gtt ttg tgt tta ata 393 
Met Val Gly Leu Met His He Gly Phe Gly He Val Leu Cys Leu He 
100 105 110 

tec ttc tct ttt aga gaa gta tta ggt ttt gec tct act get gtt att 441 
Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
115 120 125 

ggt gga tac cca ttc tgg ggt ggc ctt tct ttt att ate tct ggc tct 489 
Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe He He Ser Gly Ser 
130 135 140 



etc tct gtg 
Leu Ser Val 
145 

age ctg gga 
Ser Leu Gly 



att ctg ctg 
He Leu Leu 



tac tgg gee 
Tyr Trp Ala 
195 

ttc tec etc 
Phe Ser Leu 
210 

aac caa gca 
Asn Gin Ala 
225 

atg tat gaa 
Met Tyr Glu 



aga tgc aac 
Arg Cys Asn 



tea gca tec 
Ser Ala Ser 
15 0 

atg aac att 
Met Asn lie 
165 

ctg gtg gat 
Leu Val Asp 
180 

gtg ctt tct 
Val Leu Ser 



ttg gag ttc 
Leu Glu Phe 



aac acc aca 
Asn Thr Thr 
230 

age aac cct 
Ser Asn Pro 
245 

aac tac tea 
Asn Tyr Ser 
260 



aag gag ctt 
Lys Glu Leu 



gtt agt tct 
Val Ser Ser 



atg tgc ate 
Met Cys He 
185 

gga aaa ggc 
Gly Lys Gly 
200 

ttc gta get 
Phe Val Ala 
215 

acc aat atg 
Thr Asn Met 



gtg aca cca 
Val Thr Pro 



get aat gee 
Ala Asn Ala 
265 



tec cgt tgt 
Ser Arg Cys 
155 

ate ttg gec 
He Leu Ala 
170 

aat ggg gta 
Asn Gly Val 



att tea gec 
He Ser Ala 



tgt gee aca 
Cys Ala Thr 
220 

tct gtc ctg 
Ser Val Leu 
235 

gcg tct tct 
Ala Ser Ser 
250 

cct aaa tag 
Pro Lys 



ctg gtg aaa 
Leu Val Lys 



ttc att gga 
Phe He Gly 
175 

get ggc caa 
Ala Gly Gin 
190 

acg ctg atg 
Thr Leu Met 
205 

gee cat ttt 
Ala His Phe 



gtt att cca 
Val He Pro 



tea get cct 
Ser Ala Pro 
255 

taaaagaaaa 



ggc 53 7 

Gly 

160 

gtg 585 
Val 



gac 633 
Asp 



ate 681 
He 



gec 729 
Ala 



aat 777 

Asn 

240 

CCC 825 
Pro 



871 
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aggggtatca 


gtctaatctc 


atggagaaaa 


actacttgca 


aaaacttctt 


aagaagatgt 


93 1 


cttttattgt 


ctacaatgat 


ttctagtctt 


taaaaactgt 


gtttgagatt 


tgt ttttagg 


991 


ttggtcgcta 


atgatggctg 


tatctccctt 


cac tgtctct 


tcctacatta 


ccact actac 


1051 


atgctggcaa 


aggtgaagga 


tcagaggact 


gaaaaatgat 


tctgcaactc 


tcttaaagtt 


1111 


agaaatgttt 


ctgttcatat 


tactttttcc 


ttaataaaat 


gtcattagaa 


acaaaaaaaa 


1171 


aaaaaaaaaa 


a 










1182 



<210> 46 

<211> 267 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr lie 
15 10 15 



Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 
20 25 30 



Pro Leu Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
35 40 45 



Ala Gin Pro Tyr Gly lie Thr Ser Pro Gly lie Phe Ala Ser Ser Gin 
50 55 60 



Pro Gly Gin Gly Asn He Gin Met He Asn Pro Ser Val Gly Thr Ala 
65 70 75 80 



Val Met Asn Phe Lys Glu Glu Ala Lys Ala Leu Gly Val He Gin He 
85 90 95 



Met Val Gly Leu Met His He Gly Phe Gly He Val Leu Cys Leu He 
100 105 110 



Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
115 120 125 



Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe He He Ser Gly Ser 
130 135 140 
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Leu Ser Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 
145 150 155 160 



Ser Leu Gly Met Asn lie Val Ser Ser lie Leu Ala Phe lie Gly Val 
165 170 175 



lie Leu Leu Leu Val Asp Met Cys lie Asn Gly Val Ala Gly Gin Asp 
180 185 190 



Tyr Trp Ala Val Leu Ser Gly Lys Gly lie Ser Ala Thr Leu Met lie 
195 200 205 



Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
210 215 220 



Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val lie Pro Asn 
225 230 235 240 



Met Tyr Glu Ser Asn Pro Val Thr Pro Ala Ser Ser Ser Ala Pro Pro 
245 250 255 



Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys 
260 265 



<210> 47 

<211> 1388 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDNA 

<222> (1) . . (1388) 

<220> 

<221> CDS 

<222> (178) . . (1053) 
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<300> 

<301> Tedder, TF , Klejman, G, Disteche, CM, Adler, DA, Schlossman, SF, 
Saito, H 

<302> Cloning of complementary DNA encoding a new mouse B lymphocyte 
differentiation antigen, homologous to the human Bl (CD20) antigen and 
localization of the gen to chromosome 19 

<303> J Immunol 

<304> 141 

<305> 12 

<306> 4388-4394 

<307> 1988- - 

<309> - - 



<400> 47 

gaattccttt tttttttttt tttttaaaga tttatttatt attatatgta agtacactgt 60 

agctatcttc aagtacttga gatagaagag gccaactgat ctcagctgtg agtggctaat 120 

ttggccctta agccttggag ccttggagcc ttggagaccc aggcgtttga aaactca 177 

atg agt gga cct ttc cca gca gag cct aca aaa ggt ccc etc gec atg 225 

Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 



caa cct get cca aaa gtg aac etc aaa agg aca tct tea ctg gtg ggc 
Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 
20 25 30 



273 



ccc aca caa age ttc ttc atg agg gaa tea aag get ttg ggg get gtc 
Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 



321 



caa ate atg aat ggc etc ttc cat att acc ctg ggg gga ctg ctg atg 
Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 



369 



ate ccc aca ggg gtc ttc gca ccc ate tgt ttg agt gta tgg tac cct 
lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 



417 



etc tgg gga ggc att atg tac att att tea gga tea etc ctg gca get 
Leu Trp Gly Gly lie Met Tyr lie lie Ser Gly Ser Leu Leu Ala Ala 
85 90 95 



465 



gca gca gaa aaa acc tec agg aag agt ttg gtc aaa gca aaa gtg ata 
Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val lie 
100 105 110 



513 



atg age tct eta age etc ttt get gee att tct gga ata att ctt tea 
Met Ser Ser Leu Ser Leu Phe Ala Ala lie Ser Gly lie lie Leu Ser 
115 120 125 



561 
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ate atg gac ata ctt aac atg aca ctt tct cat ttt tta aaa atg aga 609 
lie Met Asp lie Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 

aga ctg gag ctt att caa act tec aag ccg tat gtt gat ate tac gac 657 
Arg Leu Glu Leu lie Gin Thr Ser Lys Pro Tyr Val Asp lie Tyr Asp 
145 150 155 160 

tgt gaa cca tct aat tec tea gag aaa aac tec cca tct aca cag tac 705 
Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 
165 170 175 

tgt aac age att cag tct gtg ttc ttg ggc att ctg teg gcg atg ctg 753 
Cys Asn Ser lie Gin Ser Val Phe Leu Gly lie Leu Ser Ala Met Leu 
180 185 190 

ate tct gec ttc ttc cag aaa ctt gtg aca get ggt att gtg gag aat 801 
lie Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly lie Val Glu Asn 
195 200 205 

9 a 9 tgg aaa aga atg tgt acc aga tec aaa tct aat gtg gtt ctg ctg 84 9 

Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
210 ~ 215 220 

tea get gga gaa aaa aat gag cag acg att aaa atg aaa gaa gaa ate 897 
Ser Ala Gly Glu Lys Asn Glu Gin Thr lie Lys Met Lys Glu Glu lie 
225 230 235 240 

att gag eta agt gga gta tct tec caa cca aag aat gaa gag gaa att 945 
lie Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu lie 
245 250 255 

gaa att att cca gtg cag gag gaa gaa gaa gaa gaa gca gaa ata aat 993 
Glu He He Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu He Asn 
260 265 270 

ttt cca gca cct ccc caa gag cag gaa tec ttg cca gtg gaa aat gag 1041 
Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 

ate get cct taa actcttttct tttctaagca ttattgttta gagagcttcc 1093 
He Ala Pro 
290 

aagacacata gttaccctca tctcttgtgg ccttccacaa tctattctcc atattttcac 1153 

agcttaactt tgcatagaga agccacatct agctctcctt cacatttgaa gaatgcagtg 1213 

attataaaag attgtctttt gecttgetta gggagtctta cactggcaga aacgetgaag 1273 

aatccaattc tcattcacct tttccttgga tgtgtgtctc agtagtggta atggtttttc 1333 

cgcatttcct ccatcagcag ttacagcaga aggagtcaga gagttcaggg aattc 1388 

<210> 48 
<211> 291 
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<212> PRT 

<213> Mus musculus 

<400> 48 

Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 



Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 
20 25 30 



Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 



Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 



lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 



Leu Trp Gly Gly lie Met Tyr lie lie Ser Gly Ser Leu Leu Ala Ala 
85 90 95 



Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val lie 
100 105 110 



Met Ser Ser Leu Ser Leu Phe Ala Ala lie Ser Gly lie lie Leu Ser 
115 120 125 



lie Met Asp lie Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 



Arg Leu Glu Leu lie Gin Thr Ser Lys Pro Tyr Val Asp lie Tyr Asp 
145 150 155 160 



Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 
165 170 175 



Cys Asn Ser lie Gin Ser Val Phe Leu Gly lie Leu Ser Ala Met Leu 
180 185 190 



lie Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly lie Val Glu Asn 
195 200 205 



Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
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210 215 220 



Ser Ala Gly Glu Lys Asn Glu Gin Thr lie Lys Met Lys Glu Glu lie 
225 230 235 240 



lie Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu lie 
245 250 255 



Glu lie lie Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu lie Asn 
260 265 270 



Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 



lie Ala Pro 
290 



<210> 49 

<211> 1010 

<212> DNA 

<213> Mus musculus 

<220> 

<221> cDNA 

<222> (1) . . (1010) 

<220> 

<221> CDS 

<222> (23) . . (730) 

<300> 

<301> Ra, C, Jouvin, M-He, Kinet, J-P 

<302> Complete structure of the mouse mast cell receptor for IgE 
(FceRI) and surface expression of chimeric receptors (rat -mouse -human) 
on transfected cells 

<303> J Biol Chem 

<304> 264 
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<305> 26 

<306> 15323-15327 

<307> 1989- - 

<309> - - 



<400> 49 

ggatagccca attaatgaaa aa atg gac aca gaa aat agg age aga gca gat 52 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp 
15 10 

ctt get etc cca aat cca caa gaa tec tec agt gca cct gac att gaa 100 
Leu Ala Leu Pro Asn Pro Gin Glu Ser Ser Ser Ala Pro Asp lie Glu 
15 20 25 

etc ttg gaa gca tct cct gec aaa gca gec cca cca aag cag aca tgg 148 
Leu Leu Glu Ala Ser Pro Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp 
30 35 40 

egg aca ttt ttg aag aaa gag ttg gag ttc ctg gga gca aca caa att 196 
Arg Thr Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Ala Thr Gin lie 
45 50 55 

ct 9 9ftt ggt ttg ata tgc ctt tgt ttt gga aca att gtc tgc tec gta 244 
Leu Val Gly Leu lie Cys Leu Cys Phe Gly Thr lie Val Cys Ser Val 
60 65 70 

etc tat gtt tea gac ttt gat gaa gaa gtg ctt tta ctt tat aaa eta 292 
Leu Tyr Val Ser Asp Phe Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu 
75 80 85 90 

ggc tat cca ttc tgg ggt gca gtg ctg ttt gtt ttg tct gga ttt ttg 340 
Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly Phe Leu 
95 100 105 

tea att ate tec gaa aga aaa aac aca ttg tat ctg gtg aga ggc age 388 
Ser lie lie Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser 
110 115 120 

ct g gga gca aac att gtc agt age ate get gca ggg acg ggg ate gee 43 6 

Leu Gly Ala Asn lie Val Ser Ser lie Ala Ala Gly Thr Gly lie Ala 
125 130 135 

atg ctg ate etc aat ctg ace aat aac ttc get tat atg aac aac tgc 484 
Met Leu lie Leu Asn Leu Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys 
140 145 150 

aag aat gta ace gaa gac gac ggc tgc ttt gtg get tct ttc acc aca 532 
Lys Asn Val Thr Glu Asp Asp Gly Cys Phe Val Ala Ser Phe Thr Thr 
155 160 165 170 

gaa ctg gtg ttg atg atg ctg ttt etc acc ate ctg gee ttt tgc agt 580 
Glu Leu Val Leu Met Met Leu Phe Leu Thr lie Leu Ala Phe Cys Ser 
175 180 185 

get gtg ttg ttc act ate tat agg att gga caa gag tta gaa agt aaa 62 8 

Ala Val Leu Phe Thr lie Tyr Arg lie Gly Gin Glu Leu Glu Ser Lys 
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190 195 200 

aag gtc cca gat gat cgt ctt tat gaa gaa tta aat gtg tat tea cca 676 
Lys Val Pro Asp Asp Arg Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro 
205 210 215 

att tac agt gag ttg gaa gac aaa ggg gaa aca tct tct cca gtt gat 724 
lie Tyr Ser Glu Leu Glu Asp Lys Gly Glu Thr Ser Ser Pro Val Asp 
220 225 230 

tea taa gaatcagggg accaggacaa tctgattcaa gtataatctt gaaagttgat 780 

Ser 

235 

ctttttacaa aattctcgea aaatttctgt ttgttccaca ttctgtcagt ttttcaattg 840 

gattgttctg cagatgccac tcttttagtt atgctgtatc tgatcttcta aatatctccc 900 

tttttgcgga tatcattcac tccaattttc ttgttttgtg tcacaatttc acatacatct 960 

tttctggaaa gtcatcaagg aataagttgg ctttattgta tgtctacttt 1010 

<210> 50 

<211> 235 

<212> PRT 

<213> Mus musculus 



<400> 50 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
1 5 10 15 



Gin Glu Ser Ser Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 
20 25 30 



Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp Arg Thr Phe Leu Lys Lys 
35 40 45 



Glu Leu Glu Phe Leu Gly Ala Thr Gin lie Leu Val Gly Leu lie Cys 
50 55 60 



Leu Cys Phe Gly Thr lie Val Cys Ser Val Leu Tyr Val Ser Asp Phe 
65 70 75 80 



Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu Gly Tyr Pro Phe Trp Gly 
85 90 95 



Ala Val Leu Phe Val Leu Ser Gly Phe Leu Ser lie lie Ser Glu Arg 
100 105 110 
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Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser Leu Gly Ala Asn lie Val 
115 120 125 



Ser Ser lie Ala Ala Gly Thr Gly lie Ala Met Leu lie Leu Asn Leu 
130 135 140 



Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys Lys Asn Val Thr Glu Asp 
145 150 155 160 



Asp Gly Cys Phe Val Ala Ser Phe Thr Thr Glu Leu Val Leu Met Met 
165 170 175 



Leu Phe Leu Thr lie Leu Ala Phe Cys Ser Ala Val Leu Phe Thr lie 
180 185 190 



Tyr Arg lie Gly Gin Glu Leu Glu Ser Lys Lys Val Pro Asp Asp Arg 
195 200 205 



Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro lie Tyr Ser Glu Leu Glu 
210 215 220 



Asp Lys Gly Glu Thr Ser Ser Pro Val Asp Ser 
225 230 235 



<210> 51 

<211> 2542 

<212> DNA 

<213> rat 

<220> 

<221> CDNA 

<222> (1) . . (2542) 

<400> 51 

acgtttctgt gtaacaatat cttttattcc tggatagtcc aattaatgaa aaaaatggac 60 

acagaaaata agagcagagc agatcttgct ctcccaaacc cacaagaatc ccccagcgca 120 

cctgacattg aactcttgga agcgtcccct cctgcaaaag ctctaccaga gaagccagcc 180 

tcacccccac cacagcagac atggcagtca tttttgaaga aagagttgga gttcctgggc 240 
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gtaacccaag 


ttctggttgg 


tttgatatgc 


5 


ctccagactt 


cagactttga 


cgacgaagtg 


tggggtgcag 


tgctgtttgt 


tttgtctgga 




acactgtatc 


tggtgagagg 


cagcctggga 


10 


1~ t~acrcicra t ccr 


ccatattaat 


t ctcaatctg 




aaggatataa 


ccgaagacga 


tggttgcttc 


15 


atgttgctgt 


ttctcaccat 


cctggccttt 




attggacaag 


aatttgagcg 


tagtaaggtc 




gtgtattcac 


caatttacag 


tgcgttggaa 


20 


t cataagaat 


caaggggcca 


ggacaatctg 




ttgcaacatt 


atggcaacat 


ttctgtttcc 


25 


gttctgtaga 


tacccctgtt 


tcagttatga 




gtggatatcg 


ttcaatccag 


ttttcttgtt 




ggaaagtcat 


caaaaacaag 


ctagctttta 


30 


aaaaaaatta 

^-j W W- K~ 


t tttgagagt 


t taactaaac 




ttcattttag 


aattctcaac 


aatacttgtg 


35 


tatatatatt 


taatatggta 


tttacttttc 




aatgggtttt 


tcagagacat 


aaagctttat 




atattcacta 


atacagttgt 


tgctcagtgg 


40 


gagaacatta 


t. taaatcatt 


cacrtcrtacrtQ 




agtgtgtata 


tcttttggta 


agacatgtgt 


45 


agctggaaag 


tagtcaatgg 


gaatgacaag 




taactaccaa 


caattggtag 


agattgccat 




gcctctgtta 


cacagtgagt 


tagaggccag 


50 


acgaaagaag 


atgttggttc 


aaactgttaa 




tttgactgat 


atattgagtg 


cttgagtttt 


55 


g a gggg a g ca 


gtgatgcaga 


agtctggatg 




tgaaacgtat 


ttccagactt 


ttgttctgag 




agaaggtcaa 


gacaccaaaa 


gaattatttt 


60 


tgcctatact 


tagaaggtga 


agtaaggatc 



ctttgttttg 


gaacagttgt 


ctgctccaca 


300 


cttttattat 


atagagcagg 


ctacccattc 


360 


tttttgtcaa 


ttatgtccga 


aaggaaaaac 


420 


gcaaacattg 


tcagcagcat 


cgctgcaggc 


480 


agcaacaact 


ccgcttatat 


gaactactgc 


540 


gtgacttctt 


tcatcacaga 


actggtgttg 


600 


tgcagtgccg 


tgctgctcat 


tatctatagg 


660 


cccgatgacc 


gtctctatga 


agaattacat 


720 


gacacaaggg 


aagcgtccgc 


accagtggtt 


780 


attccagtat 


agtcttgaga 


gtcgatcttt 


840 


tccgcactct 


atcaactttt 


caattggatt 


900 


tgcctctggt 


ctttaattat 


ctcccttttt 


960 


ttgtgtcaca 


gtctcacata 


caacctttct 


1020 


ttgcatgtct 


actttcatga 


acaaaaggaa 


1080 


ttagataatc 


aggtaatatt 


tgactcttag 


1140 


catgatatat 


gcccaccata 


tcaagccttc 


1200 


tatgtagata 


gattttccac 


cctcaataat 


1260 


gaaaagacac 


atattatcta 


attcatgggt 


1320 


tgtttactac 


ttggtgggta 


gtaggtaata 


1380 


agatgcatag 


gtaaaatcag 


ggacactgtg 


1440 


gaaaatgaag 


aataaactga 


tgaagacttg 


1500 


aaatgattgt 


gtataacact 


tgtagataaa 


1560 


gtatgcctaa 


aatctcccag 


cccaaggcca 


1620 


tctgggctac 


acaagatcat 


acatcaaagg 


1680 


cacagtaagg 


gatatttaaa 


caaacagaag 


1740 


taataaaact 


gaatgaataa 


cattgcgggg 


1800 


atggaggagt 


agcagaatca 


gatgaaacat 


1860 


atggttataa 


gagcaatcac 


cattaaatga 


1920 


gagatagaat 


taagacagtc 


aaaatccaca 


1980 


aaaagtagaa 


agcctaacga 


ttagttggaa 


2040 
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aagcatatta cgttaggcag cagatgtcta tagtggagaa aagttaaaca aggagaaata 2100 

atgaaccacc agagactcta catgttggtt tgggaaataa gagaaaatag caattctaaa 2160 

cgaatgcaaa ctctgaagaa gcatttccca aagggtgtgg gcagaggacc agaacatttg 222 0 

caaatgtacc tagagagcaa acctgaatag gaggtaaaat gggggaaaag cagctaagaa 2280 

aatgattttg ttgctgttat ttagatttta aaagaaacaa aaagagtcat taaaaatctg 2340 

tttgctggga tcagttattg tgttctctgt gtatgtccaa agtacaggta acttttctaa 2400 

atcttcctgt aaggctcacc tcatatgtct cttcacatag ccacaccctt gattcacagt 2460 

tactctacca cagtagtaaa ctgtgcttgt ggtctcctta tgtatcttcc actagtgttt 2520 

ataaaataaa tcagaattat tt 2542 

<210> 52 

<211> 243 

<212> PRT 

<213> rat 

<400> 52 

Met Asp Thr Glu Asn Lys Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
1 5 10 15 

Gin Glu Ser Pro Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 
20 25 30 

Pro Ala Lys Ala Leu Pro Glu Lys Pro Ala Ser Pro Pro Pro Gin Gin 
35 40 45 

Thr Trp Gin Ser Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Val Thr 
50 55 60 

Gin Val Leu Val Gly Leu lie Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 

Ser Thr Leu Gin Thr Ser Asp Phe Asp Asp Glu Val Leu Leu Leu Tyr 
85 90 95 

Arg Ala Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly 
100 105 110 

Phe Leu Ser lie Met Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg 
115 120 125 

Gly Ser Leu Gly Ala Asn lie Val Ser Ser lie Ala Ala Gly Leu Gly 
130 135 140 

lie Ala lie Leu lie Leu Asn Leu Ser Asn Asn Ser Ala Tyr Met Asn 
145 150 155 160 



-77- 



WO 02/062946 



PCT/US01/48437 



Tyr Cys Lys Asp lie Thr Glu Asp Asp Gly Cys Phe Val Thr Ser Phe 
165 170 175 

lie Thr Glu Leu Val Leu Met Leu Leu Phe Leu Thr He Leu Ala Phe 
180 185 190 

Cys Ser Ala Val Leu Leu He He Tyr Arg He Gly Gin Glu Phe Glu 
195 200 205 

Arg Ser Lys Val Pro Asp Asp Arg Leu Tyr Glu Glu Leu His Val Tyr 
210 215 220 

Ser Pro He Tyr Ser Ala Leu Glu Asp Thr Arg Glu Ala Ser Ala Pro 
225 230 235 240 

Val Val Ser 

<210> 53 

<211> 1323 

<212> DNA 

<213> dog 

<220> 

<221> cDNA 

<222> (1) . . (1323) 



<400> 53 

cccagctcac tgcaccaacg tccttccctg gccgacagct cggctaatga gaaaatggag 60 

ccagaaaaca gtggcagggc aggccttgct ctcccgaacc cgcagggacc ctccagtgca 120 

cctgaaattg aactttcaga agtacctctc cgtgaaaacc ccttactgga gaaggctggt 180 

tcatccccac cgtgccatac gtggctgatg tttctgaaga gggagctgga atttctgggg 240 

gtaacacaaa ttctgattgc cttggtatgt ctttgttttg gaataattgt ctgctccgtg 300 

atcaatattt cagaatttaa ggaagacatt ttttcatcat ttaaagcagg ctacccattc 360 

tggggagcag tgttttttgc tatttctgga tttttgccaa ttatgtctga aaagaaacat 420 

gctacatatc tggcgtgggg cagcctggga gccaacactg tccagatagc tgcgggaata 4 80 

ggaatcttta tctacggtca acctgcagac ctgcagagga gctcggctta catccacaac 540 

tgccagcaag ctcccagaga cgacttctgc tttgtggctt gtttttccac agaaattgtg 600 

gcaatgatcc tgtttctcac tattctgggg ttttgcagtg ctgtgtcact gacgatctat 660 

ggagttggag aattagtcca agggaacaag attgtagaag atcgtcttta tgaagaatta 72 0 
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aacatatatt caccgattta cagtgagctg gaagacaaag cggaggaatc ttctcctgtt 780 

gattcataag aattgtgttc agaacattct cacagaaaag ggtctagaat aagccaagat 84 0 

ctttgttaaa ggggctacag gcaaaatttt gattcttgtc tatgatctgc tagttttaca 900 

ctaggattta ttctccaaat ggtatatgct gcctgttgtt tctgttcacc taaaattccc 960 

ctcccaccac ccacttatgg gtatgtgata gactcctatt ttcttgtttc tattgctctc 102 0 

tggtagcact ttatggaagt cacgtgtaat gagcaagcat ttattctgtg cctataactg 1080 

tacaaataga gggaacaagt ggaaggggtt gagagctgag ttaaactttc tgttttggaa 114 0 

cacctgggtg gatccgtcaa ttaagagtct gactcttgac tgcagactca ggtcatgaga 1200 

tcaagcccca tgtcaggctc ctcactcagt ggggagtctg cttgggattc tttctctcct 1260 

tctccctcta tcccttcttc tgctctctgt ctcccaatct cataaataaa taaatataat 1320 

ttt 1323 

<210> 54 

<211> 137 

<212> PRT 

<213> dog 

<400> 54 

Met Asn Ser Gly Arg Ala Gly Ala Asn Gly Ser Ser Ala Ser Val Arg 
1 5 10 15 

Asn Lys Ala Gly Ser Ser Cys His Thr Trp Met Lys Arg Gly Val Thr 
20 25 30 

Ala Val Cys Cys Gly Val Cys Ser Val Asn Ser Lys Asp Ser Ser Lys 
35 40 45 

Ala Gly Tyr Trp Gly Ala Val Ala Ser Gly Met Ser Lys Lys His Ala 
50 55 60 

Thr Tyr Ala Trp Gly Ser Gly Ala Asn Thr Val Ala Ala Gly Gly Tyr 
65 70 75 80 

Gly Ala Asp Arg Ser Ser Ala Tyr His Asn Cys Ala Arg Asp Asp Cys 
85 90 95 

Val Ala Cys Ser Thr Val Ala Met Thr Gly Cys Ser Ala Val Ser Thr 
100 105 110 

Tyr Gly Val Gly Val Gly Asn Lys Val Asp Arg Tyr Asn Tyr Ser Tyr 
115 120 125 

Ser Asp Lys Ala Ser Ser Val Asp Ser 
130 135 
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<210> 55 

<211> 1222 

<212> DNA 

<213> pig 

<220> 

<221> CDNA 

<222> (1) . . (1222) 

<220> 

<221> CDS 

<222> (96) . . (827) 



<400> 55 

gtaatatcca ggtaagtatc tcagattagc acagagcttg gagaagagag agacagagag 60 

aggaagccaa gaagtgccaa aggaacagag acaaa atg atg tea tec aag cca 113 

Met Met Ser Ser Lys Pro 
1 5 

aca acc tac cct gga gtg tat gga acc aca cct gat ctt tac cca cca 161 
Thr Thr Tyr Pro Gly Val Tyr Gly Thr Thr Pro Asp Leu Tyr Pro Pro 
10 15 20 

agt aac ttt atg gtg cct ggg tct caa caa ccc cca ggg ttc ate aac 209 
Ser Asn Phe Met Val Pro Gly Ser Gin Gin Pro Pro Gly Phe lie Asn 
25 30 35 

cca cga ate caa gtt cag agt age cag gca cct ttc ate gtc tct cca 257 
Pro Arg lie Gin Val Gin Ser Ser Gin Ala Pro Phe lie Val Ser Pro 
40 45 50 

gga ate ccc aat aac agt cag cag gtt caa gga aat ata caa atg gta 305 
Gly lie Pro Asn Asn Ser Gin Gin Val Gin Gly Asn lie Gin Met Val 
55 60 65 70 

aat cca ggt aca gga aaa gca gee aca aac ttt aaa gaa gaa gca aag 353 
Asn Pro Gly Thr Gly Lys Ala Ala Thr Asn Phe Lys Glu Glu Ala Lys 
75 80 85 

aca eta ggg gca ate cag ate ctg att gga tea atg cac att ggc ttc 401 
Thr Leu Gly Ala lie Gin lie Leu lie Gly Ser Met His lie Gly Phe 
90 95 100 

gga att att ctg ggc tta atg ggc cgc act tat atg caa gtt ttg ggt 449 
Gly He He Leu Gly Leu Met Gly Arg Thr Tyr Met Gin Val Leu Gly 
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105 110 115 

ttt get tec ttg get ttt gtt age gga tat cca ttc tgg ggc ggc etc 497 
Phe Ala Ser Leu Ala Phe Val Ser Gly Tyr Pro Phe Trp Gly Gly Leu 
120 125 130 

tct ttc att att act gga att etc tgt att ttg gca tec aag aag tct 545 
Ser Phe lie lie Thr Gly lie Leu Cys lie Leu Ala Ser Lys Lys Ser 
135 140 145 150 



tec cct get ctg ata aaa age age ctg gga atg agt att gtt agt tec 593 
Ser Pro Ala Leu lie Lys Ser Ser Leu Gly Met Ser lie Val Ser Ser 
155 160 165 

ttc ttt gee ttc att ggg atg ate ctg ttg ctg gtg gat gaa age ate 641 
Phe Phe Ala Phe lie Gly Met lie Leu Leu Leu Val Asp Glu Ser He 
170 175 180 

aac ggg ctg ccc gag caa gac tac tgg gcg gtg ctt tct gga aaa ggc 68 9 

Asn Gly Leu Pro Glu Gin Asp Tyr Trp Ala Val Leu Ser Gly Lys Gly 
185 190 195 

att tea gec atg ctg att ate ttc tct etc ttg gag ttc tgc ata act 737 
He Ser Ala Met Leu He He Phe Ser Leu Leu Glu Phe Cys He Thr 
200 205 210 

tgt gtc aca gec tat ttt gec age cac ace ate ace aac ace agg ggt 785 
Cys Val Thr Ala Tyr Phe Ala Ser His Thr He Thr Asn Thr Arg Gly 
215 220 225 230 

ctg tec tgg tea ttc cac ctg tgt atg caa aca gtc cct tga 827 
Leu Ser Trp Ser Phe His Leu Cys Met Gin Thr Val Pro 
235 240 

cgacaggatt gtcctcagtt cctcccccag ttccggccac ttagctcaca ccacctaata 887 

ccagtaggta ttaaaagaag aagaagcatc ctgeaattte atgggggtag aaccgctctg 947 

cagaaacttc ttaagaagtt ggtcttcgcc gttcacgatg attgetaate tttaaagact 1007 

gtgtttcagg ttctgttctt ggtttgtcat gaatgaaaac tctatggacc tctctaccag 1067 

tgtttcttcc tacatgccac taccacgtgc tggcaaaggt gaagggctgg aggactggaa 1127 

atgactctgc aactctctta aagttagaaa tttcctgtta aagttaccct tccctcaata 1187 

aaatgttact aggagcaaaa aaaaaaaaaa aaaaa 1222 

<210> 56 

<211> 243 

<212> PRT 

<213> pig 



<400> 56 
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Met Met Ser Ser Lys Pro Thr Thr Tyr Pro Gly Val Tyr Gly Thr Thr 
15 10 15 



Pro Asp Leu Tyr Pro Pro Ser Asn Phe Met Val Pro Gly Ser Gin Gin 
20 25 30 



Pro Pro Gly Phe lie Asn Pro Arg lie Gin Val Gin Ser Ser Gin Ala 
35 40 45 



Pro Phe lie Val Ser Pro Gly lie Pro Asn Asn Ser Gin Gin Val Gin 
50 55 60 



Gly Asn lie Gin Met Val Asn Pro Gly Thr Gly Lys Ala Ala Thr Asn 
65 70 75 80 



Phe Lys Glu Glu Ala Lys Thr Leu Gly Ala lie Gin lie Leu lie Gly 
85 90 95 



Ser Met His He Gly Phe Gly He He Leu Gly Leu Met Gly Arg Thr 
100 105 110 



Tyr Met Gin Val Leu Gly Phe Ala Ser Leu Ala Phe Val Ser Gly Tyr 
115 120 125 



Pro Phe Trp Gly Gly Leu Ser Phe He He Thr Gly lie Leu Cys He 
130 135 140 



Leu Ala Ser Lys Lys Ser Ser Pro Ala Leu He Lys Ser Ser Leu Gly 
145 150 155 160 



Met Ser He Val Ser Ser Phe Phe Ala Phe He Gly Met He Leu Leu 
165 170 175 



Leu Val Asp Glu Ser He Asn Gly Leu Pro Glu Gin Asp Tyr Trp Ala 
180 185 190 



Val Leu Ser Gly Lys Gly He Ser Ala Met Leu He He Phe Ser Leu 
195 200 205 



Leu Glu Phe Cys He Thr Cys Val Thr Ala Tyr Phe Ala Ser His Thr 
210 215 220 



He Thr Asn Thr Arg Gly Leu Ser Trp Ser Phe His Leu Cys Met Gin 
225 230 235 240 
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<210> 57 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus Musculus 



<220> 

<221> conserved_motif 

<222> (1) . . (6) 

<223> X is any amino acid 

<400> 57 



Leu Gly Ala Xaa Gin lie 
1 5 



<210> 58 

<211> 4 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 58 

Leu Ser Leu Gly 
1 

<210> 59 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus musculus 



<220> 

<221> mis cofeature 

<223> X is any amino acid 



-83- 



WO 02/062946 



PCT/US01/48437 



<400> 59 

Ser Gly Xaa Leu Ser lie 
1 5 



<210> 60 

<211> 6 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 60 

Gly Tyr Pro Phe Trp Gly 
1 5 

<210> 61 

<211> 8 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 

<400> 61 

Phe lie lie Ser Gly Ser Leu Ser 
1 5 

<210> 62 

<211> 15 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 
<220> 

<221> conservedjnotif 

<222> (1) . . (15) 

<223> X is any amino acid 
<400> 62 

Ser Leu Xaa Xaa Asn Xaa Xaa Ser Xaa Xaa Xaa Ala Xaa Xaa Gly 
15 10 15 
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<210> 63 

<211> 21 

<212> DNA 

<213> Mus musculus 

<400> 63 

cacgaggcac acaagcaaag c 21 

<210> 64 

<211> 25 

<212> DNA 

<213> Mus musculus 



<210> 65 

<211> 24 

<212> DNA 

<213 > Mus musculus 

<400> 65 

tgggtgagaa cacacaatca aaac 24 

<210> 66 

<211> 24 

<212> DNA 

<213> Mus musculus 



<400> 64 

aagtgcttga cttacatact tacag 



25 



<400> 66 

cacatacaca caagagaatt agac 



24 



<210> 



67 



<211> 



24 
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<212> 



DNA 



<213> Mus musculus 



<400> 67 

ccattgtctg tactgtttct gctg 



24 



<210> 68 

<211> 23 

<212> DNA 

<213> Mus musculus 

<400> 68 

gccaaatgca tacacatgtg cac 23 

<210> 69 

<211> 24 

<212> DNA 

<213> Mus musculus 



<210> 70 

<211> 22 

<212> DNA 

<213> Mus musculus 

<400> 70 

caatcgttcc tcatatgcac ag 22 

<210> 71 

<211> 25 

<212> DNA 

<213> Mus musculus 



<400> 69 

ctggaagtga ctgggtgaca aggc 



24 
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<400> 71 

agactctggt ggt cat tact gtctc 



<210> 72 

<211> 22 

<212> DNA 

<213> Mus musculus 

<400> 72 

gaatgccaaa tgcacagaaa gg 

<210> 73 

<211> 12952 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (12952) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (4390) . . (4548) 



<220> 

<2 21> coding_region 
<222> (5017) . . (5136) 



<220> 

<221> coding_region 
<222> (6303) . . (6359) 
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<220> 

<221> coding_region 

<222> (7813) . . (8049) 

<220> 

<221> coding_region 

<222> (8851) . . (8952) 

<220> 

<221> coding_region 

<222> (10142) . . (10357) 



<400> 73 

tatctcatat tctcaaaatg atactttgaa atggggaagg taactattac cttaaactta 60 

cagagagacc acaagaggag caagcaagag agctgggact aactctgtct gctacatatt 12 0 

gccagaaatt gggagactga aggtgcaaag aagaaataat ttcttggaag ggctaacagc 18 0 

ttggttggac tgagatgggc tgaaaagcac agagggaagg gaagtaagag cactcttcct 24 0 

cggagatagg tgcacaggag gccagaaggg ataaaaatgc aaaactattt tgagtggaag 300 

agaatgaaga tgaggtaaca acagtgaggc agtttttaaa cttctcagta catgatggta 360 

cgagtggact taaaagtcta ttcaaaccaa aacccataaa ggctgtgaga tcttattcag 42 0 

tggatggaca tgaaaacagg ggtgccttta aaaataaaaa aatatttgca ggagttagtg 480 

tgagtaatta aatgcggtac tgagtgatca cctgagtctc tattttaaca ttcatgctga 540 

tttggacacc agcaaatgta aaatcttatt gttatcatat tgcggctaat gtggtccata 600 

acccctacta atctaccaac tagtctttga gcataactgt gcattgggca ctaagtcatt 660 

gtgaacaatt tattttaatg ataaggcaaa attgcgagaa tctccaatct ccgtaggcga 720 

gttttatgta tttatgaatt ctcatggaac gaaagctttt cattacagtc agcccacaat 780 

gccataaatc gattaatata tttacctctc atatatgaga cagatatata tataatcaga 840 

tatatatatg aaagatagtt tccataagaa ataaaaaaac agagaaagca ctcatgtgtt 900 

tcttttataa actactattt tatacaaata tttgtttaag gtttgtagaa gtatagttgc 960 

caccatgaga taccatttta acagagaaat atcacacaca cacacacaca gccctcatgg 1020 
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tcattattgc 


cattagttac 


aacaggcata 


gcccaagtag 


aattggctta 


caaaaaagtt 


1080 




tatctcgcca 


taatacagac 


acagctttaa 


tcaagcagtt 


gatgaataaa 


atgaaattaa 


1140 


5 


aacaatggca 


gtccaatcat 


ctccttcagt 


tcaaacaatt 


gatgaccttt 


gcattttaag 


1200 




ttatcttcac 


taaagttacc 


cctctggggc 


ggagtgtagt 


gactcccagc 


tcattccata 


1260 


10 


cttgccatag 


gaccttggga 


aaaataacct 


ctctcaacct 


ccatttcttc 


atctgcagaa 


1320 


agtagtcaaa 


atgcctactt 


acgaaggtgt 


tgagtgagtt 


ggcaatacca 


tatacaatat 


1380 




ctctagtata 


tagtaggtgc 


tcagtaaaca 


actataatta 


tctggactca 


cctatggtac 


1440 


15 


agcagaaaaa 


gcatcaacat 


cagagccatg 


gtgagtgcct 


tctagcttgg 


ctctgttgct 


1500 




aactgtgagg 


tctttactaa 


ggcaaattct 


atacccagac 


tgcagcttcc 


tcatttgtga 


1560 


20 


aatgaacaga 


atggtttctg 


aggtttggac 


tagctctaga 


aatcctacct 


agtgaccaga 


1620 


atcagtccgc 


aaactcttta 


gtgggctata 


ttaaaatatc 


tgtttttgtt 


agaggtcttg 


1680 




tagagtggct 


tgttgtacaa 


agcagatccc 


caggaccatg 


tggcatctgg 


ctatctaggg 


1740 


25 


caggggcaaa 


atcaagtgag 


aaacaaccaa 


agtttcttct 


ctctttcctc 


tagcccatgc 


1800 




gccagacaga 


taatactttt 


caactgattt 


atagtcattc 


acatagtgca 


tatattggaa 


1860 


30 


ggagaaaaga 


ggggtcataa 


gggaaaggct 


gaattcctat 


ttctcacagg 


agacaaagca 


1920 


gcctggtacc 


ctggatgcct 


aatttgagag 


atccaagtct 


tcattttaaa 


tctttacagg 


1980 




ctgggcatgg 


tggctcacag 


ctgttatcct 


agcactttgg 


gaggtggagg 


ggggcagatt 


2040 


35 


gcttgagtcc 


aggagttcaa 


gaccagcctg 


ggcaacatgg 


caagacccca 


tctctaaaaa 


2100 




aaatataaaa 


attagctgag 


tatggtggtg 


gttgcccgtg 


gtcccagcta 


ctccagtggc 


2160 


40 


tgaggcagaa 


ggatcacctg 


agcccaggag 


gtagaggctg 


cagtgagctg 


tgatcatgac 


2220 


actgcactgc 


agcctgggcg 


acagagcgag 


gccctgtcat 


gaaaacaaac 


aaacaaacaa 


2280 




aaaatattta 


ctaattatgt 


aacttggcac 


aagtaaccta 


gcttctccgt 


gcccttgttt 


2340 


45 


cctcttctat 


aaaacacagg 


tgatgattca 


ccccttgcct 


gttctataag 


gctcagatga 


2400 




gatgccatgt 


gtcaaattta 


cttccagccc 


taaagtaggt 


ataaatgtga 


cagtagtcag 


2460 


50 


ctgcctgttt 


tacagcacac 


ctgcagctac 


tcagctctga 


ttgacttcag 


taagatatac 


2520 


ttcaggctgt 


ggtctaatgg 


ctgggatgtg 


gcaccggctt 


cagtaagata 


tgtgacctgg 


2580 




atagacagtc 


ttgatttcca 


agacatctta 


catgacttac 


aagcaataaa 


gacaagagca 


2640 


55 


aagatgtgtc 


cccaaaagag 


ggattttcag 


attccaggaa 


agacccagtg 


gagtccgcat 


2700 




aaafcaattaa 


catgaacaag 


tgaatgctaa 


ggaaacccaa 


gatgaagatt 


gaacagaatc 


2760 


60 


ttaaaatgtc 


tgagctggaa 


agggccgtat 


taattgcctt 


ctcacttcag 


aaatgaataa 


2820 


atcatgaaaa 


ctgataaata 


gcatttacta 


cactctccct 


ttgtcctagg 


cacaattaca 


2880 
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tatgttaact 


tatttatgtc 


ttacatctca 


gagaggggta 


ctgttctaac 


tttacagaag 


2940 


5 


gataaaatcg 


aaactaatgc 


tcagcaaagt 


acaaagaaca 


agaatagcaa 


caaaaataac 


3000 


tatttattcc 


aacatgggtt 


ctttgcatac 


atttatttct 


tcaataatat 


ttattaagaa 


3060 




gtaactaaat 


ccaaaaatta 


ttttagatcc 


tgaacaagag 


agaacaaaat 


ctctactttg 


3120 


10 


atggaacttc 


cattctgtgg 


ggaagagact 


gacaataagc 


aattaaataa 


ataaggtaat 


3180 




ttcctacagt 


gatcaatgcc 


gtaaagcaat 


taagatagga 


ttttgtaaaa 


gacagcaaat 


3240 


15 


aggagtacat 


gttatagatt 


gagggttcaa 


ggtaggctcc 


tctaggagct 


gacatttgag 


3300 


ctacacctga 


acaaaaagac 


actagccatg 


cacagaccat 


gagcccagtt 


aagtgttata 


3360 




gcagcccacg 


agataagaat 


tattattatt 


tcaattttac 


agttgaacct 


gaggcccaga 


3420 


20 


gaatttaaag 


aacttgccca 


acatctcaga 


acaaatggag 


gaatcactat 


tgaaacctag 


3480 




gcaatctgac 


tcaggaggcc 


acagtcttat 


atactgcatt 


agaaagcctt 


agagagcctt 


3540 


25 


ttctttttct 


ttgagaccga 


gtctcactct 


gttacccagg 


ctggaatgca 


gtggcatgat 


3600 


ctcagctcac 


tgcaacctct 


gcctcctggg 


ttcaagcaat 


tctcctgcct 


tagcctcccg 


3660 




agtagctggg 


attacaggtg 


cacaccaaca 


tgcctggcta 


atttttgtat 


ttgtagtaga 


3720 


30 


gatggggttt 


tgccacgtta 


gccagcccgg 


tctcaaactc 


ctgacctcag 


gtgatctgcc 


3780 




catcttggcc 


tcccaaagtg 


cttggattac 


aggcatgagc 


caccgtgccc 


gacctagaga 


3840 


35 


gccttcttga 


tgtgacttgc 


acaaggtggc 


agagttagag 


acagagagag 


gcctggaatc 


3900 


gacccctcct 


gcttctacag 


atagtcctta 


ccatactctg 


caatgttgcc 


tctggcccat 


3960 




cataatgcac 


aaaggcagat 


aagcaaaagg 


acaaggacaa 


gtccattgaa 


aatacatttt 


4020 


40 


tcaatattaa 


agcaaaagaa 


aagcatccag 


gaataagaaa 


caaagaggac 


atgcagtcat 


4080 




atatgcaagg 


tgtcctctac 


aaagataaag 


aatgccccaa 


acccagttgt 


caagatcact 


4140 




ggcagggact 


cctgggccca 


catgctcttc 


ctaaacaacc 


cctccatctc 


ctttctcaga 


4200 


45 
















actcagcagt 


aggccttgcc 


tcagatccaa 


ggtcactcgg 


aagaggccat 


gtctaccctc 


4260 




aatgacactc 


atggaggaaa 


tgctgagaga 


agcattcaga 


tgcatgacac 


aaggtaagac 


4320 


50 


tgccaaaaat 


cttgttcttg 


ctctcctcat 


tttgttattt 


gttttatttt 


taggagtttt 


4380 




gagagcaaaa 


tgacaacacc 


cagaaattca 


gtaaatggga 


ctttcccggc 


agagccaatg 


4440 


55 


aaaggcccta 


ttgctatgca 


atctggtcca 


aaaccactct 


tcaggaggat 


gtcttcactg 


4500 


gtgggcccca 


cgcaaagctt 


cttcatgagg 


gaatctaaga 


ctttgggggt 


aagtcagttg 


4560 




ccttccatcc 


catgtcgtag 


ggattctctg 


gctgacagaa 


gctgatgcgg 


tataggccac 


4620 


60 


atacagaatt 


caatccaatt 


tgaagaattg 


ggatccaacc 


tgatgtcttc 


tttatgtcta 


4680 
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acacagtggg ccaaatcagg ggtgcatcag 
atcttatgtc accttttggt tttggggctt 
5 ttgccagaat cccaggcata cctgctccct 

ttcgaacaag aaaaagacaa aattcttggc 
cagctccaag tcaggagcca gagcttccaa 

10 

tcagctcatc cactgctgcc tctgttctct 
ttccacattg ccctgggggg tcttctgatg 
15 gtgactgtgt ggtaccctct ctggggaggc 

tgggaaatgg tgcagacaaa aatgttaaaa 
gtgttgaggg aaatatatga gtaggaaata 

20 

ttgaccaaat tgagtataaa tcccatggtt 
cagagaaagg ggccctgcat ggatctcaca 
25 tcagttctgg tggggagggt ggccctagac 
agcaagaggc agcaggaaca tccacttcca 
gttactatgg tacctggtga aacctgtccc 

30 

cagagcagca ctcttttggc tgttttacgt 
ggtgtggcac ataggacagt gatgtttatt 
35 taaaagataa gtccaggatt aaaattttcc 

ttcctccatt ctacagcctg aatgtgagaa 
cctgtaacag ccaatgtttt catggagtgc 

40 

gcatcagttc actcttcaga gggcatcaga 
cagttccaca agttaaaaac aaagaaaaac 
45 agtattaatt gatgaggaaa tttgagttct 

cagatcccca gatgattgag taaaaggcat 
aggaatatga cagctgaaaa atgaatacaa 

50 

agttgctgaa agtgtaggct tctttctgac 
ctctcctcta tctcctgtct tgcccacccc 
55 agtatattat ttccggatca ctcttggcag 

caagtaacca tatgtccttc tttcccacat 
aaactgagac ccttaaaaag ccaaggtatc 

60 

ctccacaatg ttattgtgat tttatttatg 
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agaagttatc 


acttagatca 


cctctgggtg 


4740 


gtatatgcag 


gggtccccca 


tcccagtcca 


4800 


ggaaatgccc 


catgtggttg 


aggaaacaga 


4860 


acctccactg 


cttcctttag 


gcattcctca 


4920 


ccttgtcttt 


gcctgctagc 


agtgatgatt 


4980 


ccccaggctg 


tccagattat 


gaatgggctc 


5040 


atcccagcag 


ggatctatgc 


acccatctgt 


5100 


attatggtga 


gtaaaagaat 


agcagccatt 


5160 


ggctccacag 


ggatatgcca 


gattatttct 


5220 


ttattgggtt 


aaagtaatta 


agaagacagg 


5280 


gagagtcagt 


ggtcctgttt 


catgtgaatt 


5340 


gggactgtcc 


aaagcaagaa 


ctctccaaag 


5400 


atttagacta 


gatagcaaga 


tgttttggaa 


5460 


tctacccctt 


cttgcttaca 


attctgtttg 


5520 


atcacaagtc 


agtctcattt 


tgcttatcga 


5580 


acatgttttc 


caaatctgta 


accctgtctg 


5640 


tccccgtgat 


acttttcata 


gttgccacta 


5700 


aaacaagtag 


agatgggtta 


gaacaagagt 


5760 


aaggaagatg 


agcaatagtc 


atcatcactt 


5820 


ctgtgccatt 


caggtcaagt 


atttccttct 


5880 


gtcatttatg 


tcactgtgaa 


ccccaaaggg 


5940 


tagaaataaa 


acttttaaat 


ttatggtatg 


6000 


gtctctttgg 


tcttactata 


ttcctagtca 


6060 


gaatttagtg 


tcactgagcc 


tgaataaagg 


6120 


ctgataaaaa 


tgggtggatg 


gttgtgtgaa 


6180 


cagttatcaa 


tgttaaaaag 


tgatctccct 


6240 


ctctccatct 


cccccacctc 


tcttttttac 


6300 


caacggagaa 


aaactctagg 


aagtgtttgg 


6360 


qtcagagaag 

Zj Zj Zj Zj 


tacctatttt 


tttcggttaa 


6420 


acagcctctc 


agccctaaaa 


agcaaagacc 


6480 


aaaaacttag 


aagcgaggtc 


tatctgaagt 


6540 



-91- 



WO 02/062946 PCT/USO 1/4843 7 



atgttcatgg 


gaacagaact 


aaaagcagat 


ccatgaaaac 


catacctaca 


gtcttaagaa 


6600 


cgttaaatgc 


tgtgtgaaaa 


taatagacct 


ttctgaaagc 


cctatcattt 


ctcccagatc 


6660 


accatttagg 


aaaattatct 


gatcaatgtc 


atgattgatt 


caaattctag 


ctaagccatt 


6720 


ttttggccgt 


aacattgaac 


aagtcagttt 


acctctatgt 


tcctgagttt 


tcacctagaa 


6780 


aggaagggta 


acagtccttg 


ctaccatgtg 


acgtccaatg 


gagatgaaag 


gcagtagagt 


6840 


gtgtgatggt 


gcttcacagg 


ctataaagta 


ctacactgtg 


gtcttgccca 


taaaacccct 


6900 


agggactcat 


ctagatccag 


aggaaactgg 


cctgcagagc 


tgctgatgct 


gtatggatga 


6960 


aaagagttta 


gcagcaagtt 


cgctcccaaa 


aaattcctcc 


cccaacactg 


ttactaaact 


7020 


gtgtcacttg 


cataatcaat 


gagggaatgg 


gtggattgan 


nnnnnnnnnn 


nnnnnnnnnn 


7080 


nnnnnnnnim 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnttatattt 


tcctgtttca 


7140 


accttttatc 


atccctgcaa 


gggtagaaca 


ttcacactga 


tattttctta 


cctatgctac 


7200 


ccaaagacat 


cagccctaat 


tgtattttgg 


aagatagctg 


actggggctg 


attgcaacct 


7260 


atgtcagcag 


gaatagatgt 


tgttactgtt 


gttgcttctg 


ctttttttat 


ttccatttat 


7320 


ttgatagtac 


agatctagag 


ggttctatct 


gaaccttccc 


aacctatact 


tcataatacc 


7380 


atcccactaa 


agtgtgatac 


aagaaacttc 


ttcactctct 


tccctctacc 


tatttatgaa 


7440 


ggcagataat 


aaactggata 


atatttatct 


tcacttattc 


aacaaacatt 


tattgagtgc 


7500 


ctactaggat 


ggtggcagtg 


gcagtgaagg 


aaatgcaagg 


atacaagata 


tagaatcaag 


7560 


ggttactctt 


agaatttttg 


ctttataaaa 


cagatggatg 


gtgaatgaga 


tagggaagac 


7620 


tgagaaaaga 


acaggataga 


gacatgattt 


tattttatag 


tgacaaagag 


gctaaaaaca 


7680 


actgagagaa 


cttcagtata 


tttagttgta 


gttgctttgt 


gagtcagggc 


agttgcattt 


7740 


ggaattccct 


cccagattat 


gttttccaaa 


gggaaatcaa 


acccaattaa 


taaatctgtg 


7800 


tctccatttc 


aggtcaaagg 


aaaaatgata 


atgaattcat 


tgagcctctt 


tgctgccatt 


7860 


tctggaatga 


ttctttcaat 


catggacata 


cttaatatta 


aaatttccca 


ttttttaaaa 


7920 


atggagagtc 


tgaattttat 


tagagctcac 


acaccatata 


ttaacatata 


caactgtgaa 


7980 


ccagctaatc 


cctctgagaa 


aaactcccca 


tctacccaat 


actgttacag 


catacaatct 


8040 


ctgttcttgg 


taagtgttct 


tggtaagtgt 


gagattggat 


ttctctccag 


ggaggaagga 


8100 


tgacttgttt 


attatgagca 


tgaactctgg 


atccagacca 


cctgagtttg 


ctagttactg 


8160 


tctgtgtggc 


tttgggaaag 


aattttaacc 


acactgtgcc 


tcaatttctt 


caactgtaaa 


8220 


atggggatgg 


caacacca cc 




y y t l. g t. c c l. y 
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atacaaagca 


tttagaacag 


tgtttggtac 


acagaagtgc 


tatataagca 


ttggctttaa 


8340 
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cattaattat tctcaacatc attaatggca 
agcatagaag aaattcctaa gatgttatga 
5 gagtttgaaa gagataaaaa aaaatgcatc 

atgccacacc aactaaggct gaatacaatt 
atacagaatt gattcttcaa ctgattattc 

10 

aaccatctgt tgtgtgccaa aagttgtgtt 
tgtactagca gttctcacag ctattcatta 
15 caattatagt caacaataac ttactgaaca 

tttcaccttc attcttctgt tgtttttcag 
ttcttccagg aacttgtaat agctggcatc 

20 

agacccaaat ctgtaagtag tagcccctct 
ccacaaagga tcatttatgg agaatgtaat 
25 ttaaaaaaaa aaattggtct tggcatattt 
tttgaaataa gtctgtagtt agagcctaca 
tttttttgag acagagtctc actttgttgc 

30 

cccagtgcaa cctctgcctc cgggttcagg 
tgggattaca ggcacccgcc atcatgcctg 
35 gttcaccatg ttggccaggc cagtcttgaa 

gcctccgaaa gtgctgggat tgcaggcgtg 
ttattaacct tccctgctgc ttgacttcag 

40 

cagtgtgtaa aagcaagatg gttgaaggtt 
tttttcagta ttcccctctt ccctcctttt 



50 



60 



aaagggctcc 


acaaatggga 


aataaaaaca 


cagaaccaga 


ctgttagatt 


tagaatttag 


taaagaaaaa 


tagtatcaaa 


gagagaaaat 


aaatcagaac 


gaaaatctaa 


actcctctat 


ggtcattcca 


ggcactgtgg 


tcaatgtctg 


tttaataaat 


gacttgataa 


ggatataagc 


atataataaa 


tgacataagt 


agcataaaaa 


attgttgaca 


aaggtgtcag 


tggtaaaagt 


gaacatagtt 


ctcctgtcag 


cagaagaaaa 
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ttaatattta 


ccattatatt 


aactaagatg 


8400 


atatcttttg 


cccaaggact 


tttaaatgta 


8460 


tcccaaggat 


gaagaggtct 


cctaggcatg 


8520 


taatttgaga 


ctcattactt 


attctttcct 


8580 


ataccttact 


ttctatcagc 


aatacacatt 


8640 


aagagttagg 


gttataaaga 


tgctgtctcc 


8700 


cttgtctaaa 


gaattgatct 


cttaatcgtt 


8760 


ccaacaatgt 


tctttgtgcc 


attattacat 


8820 


ggcattttgt 


cagtgatgct 


gatctttgcc 


8880 


gttgagaatg 


aatggaaaag 


aacgtgctcc 


8940 


ggccaaaacc 


tccctctaga 


aaatccacat 


9000 


ctgatgagtt 


gttgaaacta 


ggagcttgat 


9060 


ctagaaagca 


aataatatga 


tgtcttgaaa 


9120 


gttctatatt 


ctgtgcgtct 


tttttttttt 


9180 


ccaggctaga 


gtgcaatggc 


gcgatcttgg 


9240 


cgattctcct 


gcctcaccct 


cccgagtagc 


9300 


gccaattttt 


gtatttttgt 


agagacgggg 


9360 


ctcccgacct 


caggtgatcc 


tcccacctca 


9420 


agccaccatg 


ccccacctct 


gtgcatcttc 


9480 


gcttccatcc 


tgatttcatg 


attccatata 


9540 


gaaggctaag 


tcactagttc 


tttggtttca 


9600 


cacctaattc 


ctcttccaaa 


acaacaggtt 


9660 


gaccaataat 


agatatatta 


ctaactgctt 


9720 


aaatcatcgc 


taattacttc 


attttgcatt 


9780 


tattctccca 


gagttatatt 


cactaaaggc 


9840 


tactgcaatg 


ttcttttccc 


aataccacgt 


9900 


ctgcccttga 


agatttattc 


agacttgagt 


9960 


acctgcaaaa 


aaattttggc 


atttaaaggc 


10020 


ccaggaqqta 


tttgataaat 


qtttgtggaq 

ZJ ZJ ZJ ZJ ZJ 


10080 


aaagaatggt 


ttgtttaatt 


ttctgtttta 


10140 


aaaagaacag 


actattgaaa 


taaaagaaga 


10200 
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agtggttggg 


ctaactgaaa 


catcttccca 


5 


tccaatccaa 


gaagaggaag 


aagaagaaac 


tcaggaatcc 


tcaccaatag 


aaaatgacag 




tttccttttt 


taaacattag 


tgttcatagc 


10 


<t a rrrrh a t~ f~* i~ 

gay gtao, utL 








tccttctctc 


ttacattgaa 


tgtagagaat 


15 


Cttcttcttt 


tgagcaactt 


tcttacactg 




gatttcctac 


taacctgttc 


cttggatagg 




ttctccatca 


acaaccaggg 


agactgcacc 


90 




r> t- a 1- r* t- 1- 1- t- t- 






tcattgtttt 


aaggatgata 


aaaaaaaata 


25 


tttatctttc 


tacagggctg 


acattgtggc 




gaagctctaa 


atagecaaca 


cccatctgtt 




acatgtctct 


gecttaaett 


ttcctaactc 




4— -3 /-i 4— 4— 4— 4— •-a 

uacuc ccay c 




y cLdcicty ctctcici. 




gcccaaattt 


caaataataa 


ttaacattta 


35 


atttcatcaa 


ataacttcat 


ttgatgttcc 




tacaagtaat 


tcaaagaggc 


caaataactt 




cataactagt 


aattatgaga 


gccttgtttc 


40 


atgucuyaaa 
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cagcttttat 


caagaattta 


ctctctgacc 


45 


agccagattc 


agattttacc 


catggggata 




aactacttgc 


atcataaata 


tataataact 




tatagacact 


ctcagaatat 


catacttgga 


50 


O delect y L Ly L. 




aaffpanaha 
ct ct u tuay a uct 




cctagggtga 


agtcactata 


atctgtagtc 


55 


getgecagat 


tgtggcaggt 


aaagagacaa 




atttttagcg 


accactagtg 


gaagacattc 




tttctgtcta 


tgtaaagttc 


tcaaaatttg 


60 


ttttctgcaa 


cagatgattc 


caacatgggt 
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accaaagaat 


gaagaagaca 


ttgaaattat 


10260 


agagacgaac 


tttccagaac 


ctccccaaga 


10320 


ctctccttaa 


gtgatttctt 


ctgttttctg 


10380 


ttccaagaga 


catgetgact 


ttcatttctt 


10440 


tatctggect 


ttgcatggag 


tgaccatagc 


10500 


gtagccattg 


tagcagcttg 


tgttgtcacg 


10560 


aagaaaggca 


gaatgagtgc 


ttcagaatgt 


10620 


ctttttagta 


tagtattttt 


ttttgtcatt 


10680 


tgatggaaaa 


gatatatgac 


tgcttcatga 


10740 


atctacgttt 


ttggtggagt 


cccttttgca 


10800 


acaactaggg 


acaatacaga 


acccattcca 


10860 


acattcttag 


agttaccaca 


ccccatgagg 


10920 


ttttgtaaaa 


acagcatagc 


ttatacatgg 


10980 


ccactctagg 


ctattgtttg 


catgtctacc 


11040 


aaatgaccat 


agaaaatgee 


accatgaggt 


11100 


gttatattta 


taatttccag 


atgacaaagt 


11160 


atgatcaaga 


aagaatccct 


atctctattt 


11220 


gtaaacaaga 


aaaggtaact 


tgtcaacagt 


11280 


ataaccaggt 


cttcttactc 


aaatcctgtg 


11340 


tgtctgeata 


aactgtgaga 


gecaagtcaa 


11400 


agcaataaac 


aagcactgag 


agacacagag 


11460 


aaaagactca 


gactttcacc 


acatttggaa 


11520 


ggtagtttat 


atgaagcaga 


cactaagtgc 


11580 


aacaatgtaa 


ttaaaatgee 


gaatctgagt 


11640 


tactagtacc 


ttacctagaa 


ataatgttaa 


11700 


tattatttgg 


geatttgeta 


catgatgagt 


11760 


tgtaatttgc 


actccctatg 


atatttctac 


11820 


cccaaaatta 


gaaaaaaagg 


agatagaaga 


11880 


ttctaaatta 


ataaaactat 


ctttgtgttc 


11940 


gtttgtctat 


tcttctttac 


tcttgaaaca 


12000 
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ttagaccatg 


ggaggctctt 


acagccttga 


gttgatattt 


atacaaccca 


aatctaggtt 


12060 


tgaacggtga 


ggtgtcaggt 


catcaaatat 


tcatgtctat 


atagtcttac 


acaggttctc 


1212 0 


aaaaaaaatg 


ttcatgggat 


aggtcattga 


taatggattc 


cttattctga 


gaactccaga 


12180 


cgactgaaat 


atatgagaga 


aggaaaagga 


catagtagga 


gcaggcctga 


gaaaaaaatg 


12240 


aaagtcagaa 


atctttaaaa 


aaatacaaga 


tcttatttct 


atcttatttt 


ttctcctctt 


12300 


ctgaaatata 


tatgaggatt 


cctctccaaa 


cccatggttt 


ctctaagaat 


tttgagtcat 


12360 


ttgtatgacc 


tcaaataatt 


agttttagct 


gacctcacat 


aactccttat 


aataggagac 


12420 


atctttaatg 


tctgctatta 


aagaaggatg 


aaaattccta 


tgaccttctc 


cccgattatc 


12480 


cctttggcta 


tatagagtca 


aataataaca 


ttgaccaata 


gtaaacatgc 


tttgccaaga 


12540 


agtagaagat 


atattctcta 


gccttagttt 


ttcctcccaa 


tttgcatttt 


tgtaaaaata 


12600 


atgttgtatc 


cacaaaggaa 


ataaacttta 


aaaacccaag 


tgcactcatt 


gactaggagg 


12660 


ctgacgtaac 


ttattttcat 


tttcttctgg 


ttttcttata 


atctttgttt 


aaaataaggt 


12720 


gataaaccat 


cccagtttgt 


ctgagtctga 


gggattactt 


gtttggaaaa 


ttttaatcct 


12780 


ggacttatct 


tttatagtgg 


caactatgaa 


aagtatcacg 


gggaaataaa 


catcactgtc 


12840 


ctatgtgcca 


cacccagaca 


gggttacaga 


tttggaaaac 


atgtacgtaa 


aacagccaaa 


12900 


gagtgctgct 


ctgtcgataa 


gcaaatgaga 


ctgacttgtg 


atgggacagt 


tc 


12952 



<210> 74 

<211> 10100 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . - (10100) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP , or dTTP) 
<220> 

<221> initial_coding_region 

<222> (1918) . . (1973) 
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<220> 

<221> coding_region 
<222> (2844) . . (2973) 

<220> 

< 2 2 1 > coding_region 
<222> (3488) . . (3622) 

<220> 

<221> coding_region 
<222> (5944) . . (6000) 

<220> 

<221> coding_jregion 
<222> (6552) . . (6710) 

<220> 

<2 2 1 > coding_region 
<222> (7116) . . (7214) 

<220> 

<221> coding_region 
<222> (8710) . . (8805) 

<400> 74 

ggaaaccccc cccaaaaaaa aaaaaaaaaa aaaaaagaaa agaaaagaaa aagaaagata 6 0 

aaatgaaaat tcgtggctgg gcgcagtggc tcactcctgt aatctcagca ctttgggagg 12 0 

ccgaggtgag tgagtcacct gaggacagga gttcaagacc agcctggcca acatggtgaa 180 

accccttctc tactaaaaat tcaaaaaaat tagcccagtg ttgctgcaca ctcctgtagt 240 

cccagctact cgggaggctg aggcaggaga attgcttgaa tctgggaggc ggaggttgca 300 

gtgagccatt gccctccagc ctgggtgaca gagtgagact ctgtctcaaa aaaaaaaaga 360 
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10 



aaaaaaaaga 


aaatagaaaa 


ttatcgaggc 


tgttaccttg 


tctgggaatt 


ccttccctga 


ttttaccctt 


ctatgtgttc 


caacatcctc 


catttatctt 


tgttctattc 


tgcttgattg 


catctgtgca 


tgcacagtga 


taagggcagt 


atgaacgtat 


aagtgagaaa 


gttctattcc 


ccaattcctg 


aagacaccat 


qqqaagatga 


ctataccaat 


atgtctttta 


gtatactctg 


agattttctt 


tttcaccccc 


cacccccacc 



ttttagtatc ttaaatatta ttgaaagcca 

20 

taaaagagat taaaaccttc tctgaatttt 
gttatggtat ttcaatgatc acatggttta 
25 atccaatgct gtttagattt gctcatggga 
ctcactatcc cattcttgcc actgtaaaga 
gacaatggag tcttcttagc ttttacacac 

30 

cagcctgttg tatttccaag accatctgct 
acggaaagag agagaaagat gatagatatt 
35 actaacacat actcactcac atacaacagc 
agcatgcctt aaacaaggaa gacaagcttt 
agaaatggct gaattgggac acaagtgggg 

40 

ttttctgcag ttctttctca ccttctcaac 
gtaaatcctg ctagtctcag gcaaaattat 
45 tattagtctt tatttagtag acttcttaat 

gatcttaatc agccaagaaa cttatctgtc 
tcaatgtcat gaggtaaccc atttcaactg 

50 

. caccaagtct caatataata atattcttta 
gacacagaaa gtaataggag agcaaatctt 
55 caaggtatta tttttttcta ccctcagtca 

gcttaggaga tgaaacttta ttgatttagg 
gggtatgagg aagctacttg ctgtactttc 

60 

tttttactca gcttgcaaac tggaaataga 
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tatcatttta 


attttaagtc 


tgctctcaaa 


420 


actctacata 


tcccctatca 


ggcagaattg 


480 


tggcctccaa 


catattgggt 


aataatatat 


540 


tggactagaa 


ggacctggga 


ttttggcatt 


600 


gcctggcatg 


gaacactaag 


ttctttttgg 


660 


ttaattccaa 


tagcaccttg 


tttaggtcca 


720 


cacctgagtt 


aataaactta 


gatacaaaag 


780 


tttatgcaaa 


tatcttagtt 


tcctgcaaaa 


840 


aaaaaaacca 


cacatgatta 


gaggtgtgat 


900 


ttcaaggtta 


tgtttataat 


taaactgaaa 


960 


atctgtactg 


attttgctat 


ggtctagcta 


1020 


gccctaattc 


taatttaggc 


atactactca 


1080 


tttgttataa 


agttcatgtc 


tctcagtgtt 


1140 


tctaagcacc 


atcagtctaa 


taatttagca 


1200 


tatcaggtat 


ggttcacagc 


agttagaatg 


1260 


gcagcgaatc 


atgtaaacag 


tttccaaaag 


1320 


aaagtcagag 


atggatctgg 


agatttagag 


1380 


acttcagttt 


tgggtctatg 


tgtgaaacca 


1440 


tcaaaggtgc 


aattggataa 


cttctgccat 


1500 


acaattccag 


aagaagggca 


catctctttc 


1560 


tcctactaaa 


atgtctcatt 


ttcaggttct 


1620 


gctccaggag 


tctcaaattt 


tcttatttca 


1680 


ttttctattc 


atcacaagta 


aaagcctgtt 


1740 


tggcaaatga 


cttatgtata 


aagagaatca 


1800 


cctattcaga 


gcatgcagta 


agaggaaatc 


1860 


ttcctggaca 


gctcggttaa 


tgaaaaaatg 


1920 


gctctcccac 


aggagccttc 


caggtaggta 


1980 


cttagtggca 


ggggaagtca 


tagtcacggt 


2040 


catggatcca 


tctagtttaa 


ttaatatatt 


2100 


catgtggttc 


tctctccctg 


gagaggaaca 


2160 


ttttctcaca 


ttagaagctc 


attttctggg 


2220 
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tatgagacag gagagttcat actgtgtatg 
ctgagggaca catatccttc acacatgctt 

5 

ttgattggtc tttataatcc ataagctgtg 
agagtcccat ctagctattc aaagccattc 
10 ggtttgttac tgacaaaata tattcagata 

ataggtaaat gtgtttattt tcagttttac 
gaaagtaaca ctggattaag aataggaaaa 

15 

atctaatcac ttggatagtg ttcaggtgtt 
tagtttcttg gtgattacaa tgaagatttg 
20 aaactcccct ttctgtctgt cgagaatgtt 

tgaattgctt ttaaatttca cagtgtgcct 
gaagtatctt caggcagact attgaagtcg 

25 

acagttttga aaaaagagca ggagttcctg 
agagtaaggg ttgggtctag aaaagaatat 
30 ttcatgagag gtgttaggtc ctttaaaaaa 

gtccttttgg gaagagagaa gcatttcctc 
ggccaacagg agtgaaggat tcataatctt 

35 

tgatgttcaa catgactacg tacataaagg 
attaggtagt gcttgtatca attcttatag 
40 tggatcattt ctcaggacag tctaggacac 

attaaaaaat gatacaatct cgggaaaatt 
tctataggta acacaaattc tgactgctat 

45 

ctctgtactt gatatttcac acattgaggg 
tccattctgg ggagccatat ttgtgagtat 
50 aacataggtt tttctctttc tcagatctaa 

atttataatt cttaattata aattatatgt 
caacaacaaa agattctttt tacaattaac 

55 

tcccatgaga aacctgaatc taatacaagt 
atagtaattg aacctaaact ttcagaatcc 
60 atcaataaag ttatatgata aatacataca 
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tagatctctg 


gcttctgggt 


ctgacatgtg 


2280 


ttataaatac 


ttgataaagt 


aacctgcttc 


2340 


ggatgcttct 


ctgaagatga 


aaatagtaat 


2400 


cttcattgta 


ttctgtgcac 


atgaagttgg 


2460 


catttctatg 


ttaaaaggat 


tgtgagatgc 


2520 


ttgtcaacat 


agatgaatga 


gaaagaactt 


2580 


tttggcatgg 


attttgctcc 


attttgtccc 


2640 


cttggtcagt 


tacttggatg 


ctctgagctt 


2700 


aattacagga 


tggctttgaa 


aaaataaaca 


2760 


gcacagggag 


ttacagaatg 


ttctcatgac 


2820 


gcatttgaag 


tcttggaaat 


atctccccag 


2880 


gcctcatccc 


caccactgca 


tacatggctg 


2940 


ggggtgagtg 


agcctcctcc 


aactttgact 


3000 


tgagttgcat 


caactgtttt 


cccacttgga 


3060 


catggtagat 


aaagagttga 


cactaactgg 


3120 


ataaagactt 


taaattgcta 


ggacgagaat 


3180 


tatctttact 


tagatgtaaa 


gaacaattac 


3240 


cgcatggaga 


aaagtattgg 


ccttccatgc 


3300 


tggctagggt 


atcctggaaa 


atcttacgtg 


3360 


taacgcagtt 


tctcatgttt 


ggcttctatt 


3420 


tttttgattt 


tcatgaaatt 


catgtgtttt 


3480 


gatatgcctt 


tgttttggaa 


cagttgtctg 


3540 


agacattttt 


tcatcattta 


aagcaggtta 


3600 


atatctataa 


ttgtttctga 


aataacactg 


3660 


ccagttgttt 


attcccagta 


ttaacatgat 


3720 


gagcatatat 


aacatagata 


tgctcattaa 


3780 


ggtgggttaa 


acatttagcc 


cacagtttta 


3840 


taaatgactt 


gcctaagggc 


cacttgacta 


3900 


aactccagga 


acatacttct 


agcactattc 


3960 


actttatctg 


tcaactaaaa 


ataacaacaa 


4020 
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aggctgggca tggtggctca cacccgtaat 
atcacctgag gtcaggagtt tgagaccagc 
5 taaatataaa aaattagctg agtgtgatag 

gctgaggcag gaggcttgtt tgaacctgga 
ccattgcact ccagcctggg caataagagc 
aatagaaaat aaagttgtct tcatgaaaaa 
ttaagatcaa agggcatatg gtgaccttct 
15 cctggtatct cttttactca tcgttctatc 

aggtatataa ctttttaaag gtctaattga 
acaatttgct aaccttgaca cacatagaca 

20 

gtatgtatcc atcatttcca aaagcttccc 
tccactcctt gtcctctcgt tcccaagaaa 
25 aacttacatt ttttagagct ttatataagt 
tgcacagtta ttttgagatt cttcaagttt 
atatatttta attctagact atgtcacatt 

30 

cagatgactt gttcaggact ctccttgaac 
ctactgtctt tttatttcaa acacagctta 

35 atctgagtct ttaggagatt gctttgaatt 

tgtaaatgtt tggtaaaaat atcatcatgt 
atatatgtca gactctggaa atatgcatgc 
tgatggaaga cgcacataca aatagacgat 
attcatgccc tgatgttgct ggacgttgct 

45 ttagcatctc ctggaagcct catgtaaatg 

gaataagaat ctgcatttta gcaagaccct 
tttttagatg gagtctcact cttgtcgccc 

50 

cactgcaacc tctgcctccc gggttcaagt 
gggactacag gtgcatgcca cagtggctgg 
55 tttcaccatt ttggtcaggc tggtcttgaa 

gcttcccaaa gtgctgggat tacagacatg 
catttaattc tgagaagcac tctatagaaa 

60 

tgacattaat aagtaacttt atcgagtacc 
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cccagcactt tgggaggctg aggcaggtgg 4 080 

ctgaccaaca tggtgaaacc tcatctctac 414 0 

tgcatgcctg taatcccagc tacttaagag 42 0 0 

aggcagaggt tgcagtgagc tgagattgtg 4260 

gaaactctgt ctcaaaataa taataataat 4320 

tgaggaaaga gattgctggg gtgagaaaca 4380 

atgccctaga aactctttta ggtattttct 4440 

tggaaaaata ggtggatgag tgagataata 45 0 0 

catataataa attgcaagta tttcagatgt 4560 

cacatgaaaa catcaccaca ttaatacaat 4620 

tgtgtatctt tgtaactctt tcttcctccc 4680 

acattgatct gctttctgtg aatataaatt 4740 

atgttctctt tactgtttgt cttccttcgc 4800 

tttctttata tcgatacttc attcacaaga 4860 

gactttgtag tctgctaaat ccttagtgct 4920 

ctgtacctct gttataattg aaacttgtct 4980 

tgaggtgtct ctcaacccat caaactcaca 5 04 0 

tgtgctattg acttatattt atatcaaata 5100 

acattttcat aattactcta tattcacatg 5160 

cacagacacg tgtttcttgc ctaaaggggc 5220 

tgcaatagaa tgagagtggt ggtctaatcg 5280 

actccaagag taacccctgc attgtcaggg 5340 

aagaatttca tgctccatcc aggacctaat 5400 

catatgattc atatacactt tttttttttt 5460 

aggctggagt gcaatggcat gatcttggct 5520 

gattctcctg tctcagcctc cctagtagct 5580 

ctaatttttg tatttttagt agagacaggg 5640 

ctcatgacct ccggtgattc ccccgcctcg 5700 

agccaccaca cccagcctta ttcgtataca 5760 

ataagaataa gaaaatattg ggctcacagg 5820 

ccaaatgtta cctatgtttg gaagatgggg 5880 
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ttaaaaagga cacattgaaa acaagaaact cattgtggct tttttttcct cctttttgaa 5940 

cagttttcta tttctggaat gttgtcaatt atatctgaaa ggagaaatgc aacatatctg 6000 

5 

gtgagttgcc cgtttctgtc tttgtccatc cttgaaaaga taagaagaac agagttttaa 6060 

gagtcttaag. ggaaacacat ctttgtctcc tatattactt gtgaatgtgg atatatgatt 6120 

10 ttgtttcaat ctattttgtg tcctaaggct ttttgcaaca gaagttggat ataatcatta 6180 

gaaacataaa ttgtaccatt taacatacaa tgaagtttat gtttaccttg acgtttcttc 6240 

taaaaaaagt gtcctcacac cggcattgtc cttgtaggca tattcacatg atcaaataaa 6300 

15 

taattagttt tcaattaagg agaatatttg aggaaagacc gtacgtgttc atgtggttcc 6360 

tgaaggcagt ccagtgagaa agtaatatat gctcattaaa caatgcggac attttcaggg 642 0 

20 tttccctttt taaccaaaat ttggaagcaa tgtggaattt actggatgca tccagccctg 6480 

aaatgaagat aggtttattg aatgtgccag caagtgcagg cccaggtctg agtgttcttc 654 0 

attattatca ggtgagagga agcctgggag caaacactgc cagcagcata gctgggggaa 6600 

25 

cgggaattac catcctgatc atcaacctga agaagagctt ggcctatatc cacatccaca 6660 

gttgccagaa attttttgag accaagtgct ttatggcttc cttttccact gtatgtattt 6720 

30 ttttttgtgt gggaagacta agattctggg tcctaatgta agtaagaagc cctcttctcc 6780 

tgttccatga acaccatcct tttctgtaac ttctattaca cagtatagtg gttctgtaag 6840 

ttcacacagc ccagggagat gctggctgcc cactcccctc aacccaggca aattcctcgg 6900 

35 

ggttaaagtt atctactgca agtgacgatc tctgggtttt tctgtgcctg tgtttgtgtg 6960 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgta tgtgtcactt taaaaggact ggtcagatgg 702 0 

40 tagggagatg aaaacaggag atgctataag aaaataaact tttggggcga ataccatgtg 7080 

actctttttg tttgtcattt gttgctgttc aataggaaat tgtagtgatg atgctgtttc 7140 

tcaccattct gggacttggt agtgctgtgt cactcacaat ctgtggagct ggggaagaac 7200 

45 

tcaaaggaaa caaggtagat agaagcccga tataaaatct tgaatgacag gttaacgaat 7260 

tggagcttta ttccttaaaa atatggcctg ggttttctga aacatttctt ccagaaaata 7320 

50 gtttctccaa gttttattac tttggtttac aaatctcaca ttttaaatca cattttatac 7380 

ccataagtag cacacatttc ataatatatc cctctgaatg agggttggga taataggatc 7440 

tgtataatgt tagaaaatgc cttaaagtgt gtggagcatg agagatggat gtgcagaagg 7500 

55 

cttgtgagga aaccacccag gtatctggcc ttgttttctg ccccagaagt agccgcctat 7560 

tcctgtttct gttttattcc tttgtttctt gacttttcct ttccaacttg ctctaaaacc 7620 

60 tcagttttct ttcctttctg attcatgact accaaatgtt tccacttgcc tcacccgtcc 7680 
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attacacctt tgataagaac caccagcacc 
gaagaaacat actctctcca tctgtccact 
5 taaaatcact ctcctccagg ctgggcacgg 

gaggctgagg agggcggatc acttgaagtc 
aaaaccaaat cttcaattaa aaccaaatct 

10 

aaacaaaaca acaacaacaa aaacagaaaa 
cctgaggttc cagatacttg ggaggctgaa 
15 aggttgcagt gagccgagat catgccactg 

cccagcacat tgggaggcca aagctgaaga 
ctggccaaca tggtgaaact ccgtctgtac 

20 

gcacacacct gtaatttcag ctacttagga 
gaggcggaag ttgcagtgag ccaagatcgt 
25 tgagattctg tctcaaaaaa aataaaagaa 
ataaataaga cagcagatat actaaggaat 
atttcttttg ggcccacttg gccagctagt 

30 

aatcagagtt taatgacaga gagcgtgaga 
ctcttgagcg agacttctag ggatgggaaa 
35 ccttatcagg ttccagagga tcgtgtttat 
agtgagttgg aagacccagg ggaaatgtct 
cagaacactc tgattcacag ccaaggat cc 

40 

ggaaaaattt ctattctctc cacagcctgc 
agaatatttt gtttctgctg cttctgtcca 
45 gatagactcc tatttttctt gttttatatt 

acatgtagta agcaagattt aactgtttga 
cggtggttga aacttgagtt aaacctttga 

50 

tttttttttt ttagcattct taatagttac 
acccacacag tcacagtcac acacacatat 
55 gcaaatattt taccaccagt caataataca 

tctgtataat tgcctgaatc accagcacat 
aagtaggaag tggggaattt tattaagtta 

60 

aacagggaaa ttataagtgc agagattaac 
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ttgtgctcat 


gtacttgccc 


atgtctgatg 


7740 


ttcctgaggc 


attcaagtct 


agccaccttt 


7800 


tggctcacgc 


ctgtaatctc 


agcactttgt 


7860 


aggagttcaa 


aaccagcctg 


gccaaatggc 


7920 


taaaccaaat 


ctctactaaa 


aaatacaaca 


7980 


ggaaacatta 


gcccagcgtg 


gtggcaggta 


8040 


gcaggagaat 


cgcttgagcc 


caagagatgg 


8100 


caccacagcc 


agggtgacag 


agccatactt 


8160 


ataatttgag 


gtgaggattt 


ggagaccagc 


8220 


taaaaatata 


aaacttagtg 


gggcatgggg 


8280 


ggctgaggca 


ggagaattgc 


ttgaacccgg 


8340 


ggccactgca 


ctccagcctg 


ggtgacatag 


8400 


atttaaaaaa 


tcactctctt 


ccaaagatag 


8460 


aacctcacca 


acttgtcatt 


gactgacatg 


8520 


ctggtttggt 


tttctggaaa 


tgaaagaaat 


8580 


cccagaaaga 


caaaagtaga 


tgaggtaagt 


8640 


tttgtggtga 


ttgatatgaa 


atgatttttc 


8700 


gaagaattaa 


acatatattc 


agctacttac 


8760 


cctcccattg 


atttataaga 


atcacgtgtc 


8820 


agaaggccaa 


ggtcttgtta 


aggggctact 


8880 


tggttttaca 


ttagatttat 


tcgcctgata 


8940 


ccttaatatt 


ctccttctat 


ttgtagatat 


9000 


atgaccacac 


acatctctgc 


tggaaagtca 


9060 


ttataactgt 


gcaaatacag 


aaaaaaccca 


9120 


ccgcttgaaa 


aaatttggtt 


tttagggttt 


9180 


agttgggcat 


gatttgtacc 


atccacccat 


9240 


gtattactta 


cactatatat 


aacttcctat 


9300 


tttttgccaa 


gacatgaagt 


tttataaaga 


9360 


tcactgacat 


gatattattt 


gcagattgac 


9420 


ctcgttgtct 


ggggaggtaa 


ataggttaaa 


9480 


atttcacaaa 


tgtttagtga 


aacatttgtg 


9540 
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aaaaaagaag actaaattaa gacctgagct gaaataaagt gagtggaaat ggaaataatg 9600 

gttatatcta aaacatgtag aaaaagagta actggtagat tttgttaaca aattaaagaa 9660 

taaagttaga caagcaactg gttgactaat acattaagcg tttgagtcta agatgaaagg 9720 

agaacactgg ttatgttgat agaatgataa aaagggtcgg gcgcggaggc tcacgcctgt 9780 

aatcccagcc ctttgggagg ccgaggtggg cagatcacga agtcagtagt ttgagaccag 984 0 

cctggccaac atagtgaaac cccgtctcta ctaaaaatac aaaaaaaaaa ttagctgggt 9900 

9 t 99 t 99 ca 9 tcacctgtag tcccagctac ttgggaggct gaggcaggag aatcgcttca 9960 

acctgggagg cggaggttgc agtgagccga gatcgcacca gtgcactcca gccttggtga 10020 

caatgggaga ctccatctca aaaaaaaaaa aaaaaaaaaa aaaaataaaa atttacaaat 10080 

tttaaaatgg ggggaaaatt 10100 

<210> 75 

<211> 22884 

<212> DNA 

<213> homo sapiens 

<220> 

< 2 2 1 > genomi c_DNA 
<222> (1) . . (22884) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (12238) . . (12393) 

<220> 

< 2 2 1 > coding_region 
<222> (13544) . . (13681) 

<220> 

<221> coding_region 
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<220> 

<221> coding__region 

<222> (18027) . . (18188) 

<220> 

< 2 2 1 > coding_region 

<222> (19594) . . (19695) 



<220> 

<221> coding_region 
<222> (20223) . . (20249) 

<400> 75 

tttttgttct gccatagttt actgagaatg atgtttccaa tttcatccat gtccctacaa 60 

aggatatgaa ctcatcattt tttatggctg catagtattc catggtgtat atgtgccaca 120 

ttttcttaat ccagtctatc attgttggac atttgggttg gttccaagtc tttgctattg 180 

tgaatagtgc cgcaataaac atacgtgtgc atgtgtcttt atagcagcat gatttatact 240 

catttgggta tatacccagt aatgggatgg ctgggtcaaa tggtatttct agttctagat 300 

ccctgaggaa tcgccacact gacttccaca atggttgaac tagtttacag tcccaccaac 360 

agtgtaaaag tgttcctatt tctccgcatc ctctccagca cctgttgttt cctgactttt 420 

taatgattgc cattctaact ggtgtgagat gatatctcat agtggttttg atttgcattt 480 

ctctgatggc cagtgatgat gagcatttct tcatgtgttt tttggctgca taaatgtctt 540 

cttttgagaa gtgtctgttc atgtccttcg cccacttttt gatggggttg tttgtttttt 600 

tcttgtaaat ttgtttgagt tcattgtaga ttctggatat tagccctttg tcagatgagt 660 

aggttgtgaa aattttctcc cattttgtag gttgcctgtt cactctgacg gtagtttctt 720 

ttgctgtgca gaagctcttt agtttaatga gatcccattt gtcaattttg gcttttgttg 780 

ccattgcttt tggtgtttta gtcatgaagt ccttgcccat gcctatgtcc tgaatggtaa 84 0 

tgcctaggtt ttcttctagg gtttttatgg ttttaggtct aacgtttaag tctttaatcc 900 

atcttgaatt aatttttgta taaggtgtaa ggaagggatc cagtttcagc tttctacata 960 

tggctagcca ctttccccag caccatttat taaataggga atcctttccc cattgcttgt 1020 
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ttttctcagg tttgtcaaag atcagatagt 
ctgttctgtt ccattgatct atatctctgt 

5 

ctgtagcctt gtagtatagt ttgaagtcag 
ggcttaggat tgacttggtg atgcaagctc 
10 ttttttccaa ttctgtgaag aaagtcattg 

aaattacctt gggcagtatg gccattttca 
gaatgttctt ccatttgttt gtatcctctt 

15 

ccttgaagag gtccttcacg tcccttgtaa 
aagcaattgt gaatgggagt tcactcatga 
20 ataagaatgc ttgtgatttt tgtacattga 
ttatcagctt aaggagattt tgggctgagt 
catctgcaaa cagggacaat ttgacttcct 

25 

tctcctgcct aattgccctg gccagaactt 
gagggcatcc ctgtcttgtg ccagttttca 
30 gtaagatatt ggctgtgggt atgtcataga 
caatacctaa tttattgaga gattttagca 
ttctgcatct gttgagataa tcatgtggtt 

35 

aaaaaaaacc ctgtcaaaaa gtgggcgaag 
atttatgcag ccaaaaaaca catgaaaaaa 
40 caaatcaaaa ccacaatgag ataccatctc 

tcaggaaaca acaggtgctg gagaggatgt 
tgggactgta aactagttca accattgtgg 

45 

gaactagaaa taccatttga cccagccatc 
aaatcatgct gctataaaga cacatgcata 
50 gcaaagactt ggaaccaacc caaatgtcca 

cacatataca ccatggaata ctatgcagcc 
gggacatgga tgaaattgga aatcatcatt 

55 

aaacaccgca tattctcact cataggtggg 
aaggggaact tcacactctg gggactgttg 
60 taggagatat acctaatgct aaatgacgag 
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tgtagatatg cggcgttatt tctgagggct 108 0 

tttggtacca gtaccatgct gttttggtta 1140 

gtagcatgat gcctccagct ttgttctttt 12 0 0 

ttttttggtg ccatatgaac tttaaagtag 12 6 0 

ataacttgat ggggatggca ttgaatctat 132 0 

cgatattgat tcttcctacc catgagcatg 1380 

ttatttcatt gagcagtggt ttgtagttct 144 0 

gttggattcc taggtatttt attctctttg 1500 

tttggctctc tgtttgtctg ttattggtgt 1560 

ttttgtatcc tgagactttg ctgaagttgc 162 0 

caacagggtt ttctagatat acaatcatgt 1680 

cttttcctaa ttgaataccc tttatgtcct 1740 

ccaacactat gttgaatagg agtggtgaga 1800 

aagggaatgc ttccaatttt tgcccattca 186 0 

tagctcttat tattttgaga tatgtcccat 1920 

tgaaggttgt tgaattttgt caaaggcctt 1980 

tttgtctttg gttatatcaa atttacaaga 2040 

gacatgaaca gacacttctt aaaagaagac 2100 

tgctcaccat cactggccat cagagaaatg 216 0 

acaccagtta gaatggcaat cattaaaaag 222 0 

ggagaaatag gaacactttt acactgttgg 2280 

aagtcagtgt ggcgattcct cagggatcta 234 0 

ccattactgg gtatataccc aaaggactat 24 0 0 

cgtatgttta ttgtggcact attcacaata 2460 

acaatgatag actggattaa gaaaatgtgg 252 0 

ataaaaaatg atgagttcat gtcctttgta 2580 

ctcagtaaac tattgcaaga acaaaaaacc 264 0 

aatggaacaa tgagaacaca tggacacagg 2700 

tggggtgggg ggagggggga gggatagcat 2 76 0 

ttaatgggtg cagcacacca gcatggcaca 2820 
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tgtatacata 


tgtaactaat 


ctgcacattg 


taataataaa 


ataaaaataa 


agtctctctt 


tagaactcaa 


tggttacctt 


tcgtgaggag 


agctgtaata 


tttcacatga 


agcccatctt 


aatcccatta 


aaatggctgt 


caaaaatgaa 


ttacacccta 


aaataaagta 


cataggccac 


aatgcatttg 


ttttttccta 


tcttatttcc 


tacccagtag 


ttctacaatc 


tggcttcaca 


tttgtgttgg 


gctgagtgca 


gtggctcatg 


gcgcgtggat 


tacttgaggc 


caggagttca 


gtctctacta 


aaaaaaatag 


aaaaattaac 


cctactcagg 


aggctgaggc 


acaagaatcg 


gccgagattg 


tgccatggca 


ctccagcctg 


aaaaaaaagt 


tcaccttcat 


gttgggttca 



acttcccaga agattccact aggtagttga 

30 

ctaggctatt tccatgttta ccaataaatg 
cctctaaaac aacagtgcct tcattcaaat 
35 cgtttatttg aaaacacagc atgaggtcaa 

tgttggtttt attttatttg atggctcatt 
gtccgtctta atcttttatt aatcttacaa 

40 

ttcaacttgt gacttttgca acattataaa 
gcaccccaaa gtcaaaaacc ttaggtggtt 
45 gatatataaa ggtactgctt catgaagaat 
agtgtgtatg ggggggtaaa acagacagtg 
ccagcaagtc ttgagaaata tattgctttt 

50 

ctagactata ggtgaattca ctcgaagatt 
gttttgaact tgattattct ttaaggtatt 
55 gatcatgtct ccttgccatt gggaaatatt 
ttctattttt ccctttttaa tttgtttaaa 
cagccataaa aaagaataag atccagtcat 

60 

ttctgtcaag tgcaataagc caggcacaga 
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tgcacatgta 


ccctaaaact 


taaagtataa 


2880 


gccataaaaa 


aaaaaaagaa 


tgtgttcatg 


2940 


aatgcaaatc 


aaggcaaaag 


aagattaaag 


3000 


gcactataga 


aaatgtactt 


atcaggatga 


3060 


atctgtttaa 


ctccaaacta 


agatacaatt 


3120 


cctagtagaa 


gaggtgatct 


ttcttagaga 


3180 


aactatctcg 


ttataaaggt 


cttttagaca 


3240 


ttagcgttac 


cttgggacct 


ttaaaaaatg 


3300 


cctgtaatcc 


tagcaccttg 


ggagtttgag 


3360 


agaccagcct 


ggtcaacatg 


gtgaaaccct 


3420 


tgggcatggt 


ggtgtgcccc 


tgtaatccca 


3480 


cctgaacctg 


ggaggcagag 


gctgcagtga 


3540 


ggcaacagag 


taagactgtc 


tcaaaaaaaa 


3600 


tgacaaatat 


cctgatttaa 


tgtcttataa 


3660 


aagtagaact 


gttgtattag 


cattttctat 


3720 


acatgataaa 


ttcttccagt 


tttcttagga 


3780 


gctaagattg 


gatttctagt 


ccattttcct 


3840 


aacaagaatg 


aatcaaaggt 


agtttctgat 


3900 


atttcaagac 


ctcttggaac 


aacaatcctt 


3960 


ttatagtaac 


taatgccatt 


tcaacgtgat 


4020 


ctggtaaatt 


atgaattaaa 


cacgtttaat 


4080 


aagtgtttta 


agagcttata 


catgtgttaa 


4140 


gtgattttaa 


tatgtacaca 


catgggcatg 


4200 


caagcaataa 


tatttagtaa 


gtagataaga 


4260 


tagtcaaatg 


gggcacttgt 


aaatgaatag 


4320 


tccttttcct 


tttggaaatt 


tgctacttga 


4380 


tcagcactga 


aagctagtca 


gtctctcctt 


4440 


gcatgcaagt 


ggaaaaaagg 


tctatgaatc 


4500 


aaatttttta 


atttatttat 


tttxatutat 


A C <Z f\ 

4 boO 


ttgcaacaac 


atgaatggaa 


ctggagatca 


4620 


aaaacaaaca 


tcacatgttc 


tcacttgtgg 


4680 
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gatctaaaaa ccaaaacaat tgaactcatg 
ggctgggaag ggtagtgggg gcctgggagg 

5 

aatagaataa ataagaccca ctatttgata 
tacattttta agataaagag tgtaattgga 
10 ggggatggat accccattct tcatgatgtg 

tgtctcatgt acctcataag tatatacact 
gccgggcgtg gtggctcatg cctgtaatcc 

15 

cacgaggtca agagatcgag acaatcctgc 
aatacaaaaa ttagctgggc gttgtggtgt 
20 gaggcaggag aattgcttga acccggaggc 
tgcactccag cctggtgaca gagtgagact 
ataattttta tgtatttatt ttactgttcc 

25 

acatatgttc tgcagatcta atgaacagca 
cttgaagttt gttaagagag gagatcttaa 
30 aaagatgatt atgtgatctg atgtgatcag 

caatgcacac atgtgtcaaa acatgttgta 
aatcttcctt tgaaagcagt ttctcacaat 

35 

atcattttta ttatttactt tgaactcatg 
gcattctgtc atttatactt tataccccaa 



45 



55 



gggagaaccg 


aagtcttaga 


gtttaagtaa 


aattcaaaaa 


gagcatgttg 


aaagcatcca 


ctcttcttcg 


ctgaaggcat 


tttcttgtca 


tggttctgga 


taaaacgggg 


agggaaggat 


tccttctgac 


ttctgtctct 


gtctctgtct 


aaatgtagac 


cctggagagc 


cgctgcccta 


cacctgtgca 


gaaattttct 


ggcaactggg 


cccacacact 


atgttttggt 


cttggtaact 


cccactgacg 


ctcctgtttg 


taggactagg 


tctgtctcac 


agactcccct 


ggggcatgga 


ggcctggccc 


tgtgaccttc 


ataaggattg 



aagacagaaa 


gtagaaggat 


ggttaccaga 


4740 


ggaggtgggg 


atggttaatg 


ggtacaaaaa 


4800 


gcaaaacagg 


gtgactgtag 


tcaataactg 


4860 


ttgtttgcaa 


tgcaatggat 


aactgcttaa 


4920 


agtattatgc 


atttcatgcc 


tgtatcaaaa 


4980 


aggtacttac 


aaaaatggaa 


aaaatagggg 


5040 


cagcactttg 


ggaggccgag 


gcaggtggat 


5100 


ccaacgtggt 


gaaaccccat 


ctctactgaa 


5160 


gtgcctgtag 


tcccagctac 


tcgggagtct 


5220 


ggagcttgca 


gtgagctgag 


atcgcgccac 


5280 


ctgtctcaaa 


taaatacata 


aaaaataaaa 


5340 


cccctcagct 


ttaatgaggt 


acagtcagat 


5400 


tgataactga 


agttaatact 


gtactatgta 


5460 


atgttctcac 


caccaccacc 


aaaaaaaaaa 


5520 


tagattatct 


gctttgtgtt 


agtgacttca 


5580 


catggtaagt 


atattaaatt 


tttatttgtg 


5640 


cattctctaa 


ggaattgaag 


aaagagctca 


5700 


tgaagatctt 


agttattaag 


acctttacat 


5760 


agtgggcact 


atgatcttca 


acttacaaat 


5820 


cttcctttca 


tggctttttc 


tacagctcca 


5880 


ctctttcctt 


tcatttcttc 


agatttcttt 


5940 


tctctttctt 


cttcagtatt 


ccatttgcaa 


6000 


cttatcttta 


ccaccttcct 


tttggctatt 


6060 


tctctttcat 


cagtcccttc 


cttcgctcct 


6120 


gggctcaggt 


gccagctctc 


ttagcagaag 


6180 


cccctgctca 


tttcatattc 


acaatctctg 


6240 


ggccctcttt 


ctctttcctg 


tgatcacgtc 


6300 


acctcgcttt 


ccttgctgtg 


aacattcttc 


6360 


tcaggggctg 


tgccctcatt 


ctttctctca 


6420 


tggtttgaat 


cttcacgtta 


cctacttcct 


6480 



-106- 



WO 02/062946 



10 



20 



tactccttgt 


cacttggctc 


tgatctttct 


aagcatataa 


tttggccact 


tttgaaactg 


gtaatcccag 


cactttggga 


ggctgaggca 


ccagcatgac 


caatacggtg 


aaaccctgtc 


tggtggtgca 


tgcctgtagt 


tgcagctatt 


cccgggaggc 


ggaggttgca 


gtaagccaag 


agagtgagac 


gccatctcaa 


taaacaaacg 


aaaaccaaaa 


aacaaaaaag 


aaactgaaag 


aggcacactt 


cgggcttcac 


cattgtacaa 


taaattttag 


gaaagctggg 


taacatttct 


gttaatagta 


gtacctgggt 


cctgggttat 


agcattcaga 


acaatgacct 


gtatgtagta 


attattatta 


tatctgtgat 


attgggcaaa 


gaatgtcatc 


acgcatataa 


ggtacatagc 


gcctgagtga 


ccacctccct 


cctccagatg 



30 

tgctatgcac ctaacactgt tctacctcca 
tattgttgct tgtgtttcct gttccagcaa 
35 cagttatccc tcacaccagt ttcctaatca 

agcgaggccc caaatgcaga tgtgattagc 
caccaagtcc tctaaagtgt tgcataaggc 

40 

gttatcggga aaagatgctg tagtgatctt 
gacttgggag gaaagccgtc tgccaaagcc 
45 atagacattc ccctttcttt tatatctttc 

atgtctgcaa aggtctctgt ttggttgcta 
atggcttctc tttaggatgt gatgattcct 

50 

ataatacagc aatttctttg ggaaaatgtg 
tcttaggtac attaaatccc actggctgca 
55 ggggtggtct cgatatgcct gaagtatggg 

gggacagtgg agtgtaatga tagcttgagg 
tgtgtgggct tgggaaagct acttaacttc 

60 

ggaaaggatc atagcagtag tacccattca 
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attgcctatt 


ttaggacatt 


tatttaggat 


6540 


aaaggaggcc 


gggcgcgatg 


gctcatgcct 


6600 


ggtgaatcac 


cggaggtcag 


gagttcgaga 


6660 


tctactaaaa 


atacgaaaat 


tagctgggag 


6720 


caggaggctg 


gaacaggaga 


attgcttgaa 


6780 


attgcaccat 


tgcattccag 


cctggatgac 


6840 


aaacaaacaa 


acaaatacaa 


aaaacctaaa 


6900 


gaggggtttt 


aacaatcata 


ttaggataac 


6960 


aacagagatt 


ttgcacaggg 


cttttccagt 


7020 


ataccatagt 


tttctctacc 


ataacatcgg 


7080 


tgtgaggatg 


agatgaatta 


acatatgtaa 


7140 


accatttaat 


acatttaaac 


aataattagc 


7200 


gtaattaacc 


tctctaagca 


tgtaagaatt 


7260 


accgttgctg 


gcacatcatg 


taaaataagt 


7320 


cttctcttac 


agtgtgactt 


tgtctcttct 


7380 


acccacaacc 


tcttcccact 


tgtctaccca 


7440 


tgtgttaagc 


aacatctgaa 


agccagaaga 


7500 


tcactaagcc 


ctggggaaac 


acaatggatc 


7560 


acactggtca 


gtcagcccgc 


aaaacacttt 


7620 


aaagataacg 


ttaagtctgt 


ggaggcttcc 


7680 


ttctgagtgt 


ctcctacttg 


cgacaaggtg 


7740 


tgaagcctcc 


aaggtataga 


ttccctctcc 


7800 


gttaaaatgc 


ctcccagtgg 


actactgggg 


7860 


tgtagaggtt 


tggaacattt 


aactgtgaac 


7920 


ttggagccat 


gataaaaagt 


gttagacata 


7980 


ggtatggata 


cacacacatg 


cacacgcacc 


8040 


cctgagtttg 


tctgccagct 


ctgataatga 


8100 


gtctgcaaag 


tgcattttag 


catagctcag 


8160 


ttctgatttt 


tagcttctca 


aattgctggc 


8220 


tttgttcctc 


agcttcttca 


tctgttagat 


8280 


tctgatgttg 


tgagaactaa 


attagtaaac 


8340 



-107- 



WO 02/062946 



PCT/US01/48437 





atggataaag 


cccttagaac 


aaatgcctgc 


5 


actggcagtt 


tcctagaaaa 


tctacatttc 




agagagacag 


agtctcgctc 


tgtcacccag 




ctgcaaactc 


cgcctcatgg 


gttcaaacaa 


10 


gattacaggc 


qcqtqctqtc 

ZD ZD ZD *-ZJ 


acaaccagct 




ttcaccatgt 


tagcaaggat 


gatctcaatc 


15 


tcccaaagtg 


ctgggattac 


aggcatgagc 




ttaaaaataa 


ataactgatt 


taaccaaatt 




ctgaaaattc 


tccctgatta 


ttttgattat 


20 


aqatatttgc 


tctcttcaga 


aatgctcttg 




tgcagaggta 


ttttccatca 


tcatttatca 


25 


tggttactga 


ttgaacatgt 


gtgttaactg 




gcactggagt 


tgcagtgatg 


agtagacaac 




agctcaggct 


gattaagtgg 


tatgagaagg 


30 


aqqqqtqcat 


gcttcttcca 


qaqttqqcca 




atgtgggctg 


aaaatcagtc 


tggaagggcc 


35 


cactttggga 


ggctgaggtg 


ggtggatcac 




aacatggtaa 


aaccccatct 


ctactaaaag 




gcctgtaatc 


ccagctactt 


gggaggctga 


40 


qaqqttqcaq 


tgagctgaga 


tcgggcctct 




ccaactcaaa 


aaaaaaaaaa 


annnnnnnnn 


45 


nnnnnnnnnn 


nnnnnnnnnn 


nttcagtctg 




ttgaggggac 


ttgtctttca 


ggaatggggg 




tttagaactt 


ctgaaggaag 


acaaatcaag 


50 


qqatqqqtqa 

ZJ~> ZDZJZD w ^ 


qqqqtqaqqc 


aggagacgta 




cattctccgt 


attgcatgca 


tcttagttca 


55 


ttcccagtgg 


ataaatttga 


agtcttggct 




gataattacg 


ttaggaactt 


attcacatag 




cgctcttggg 


ttctgtacag 


tgttgaacac 


60 


tttatccata 


cttaaataca 


tacagaatgt 



cacatactaa gagctggaga gatgaactta 84 00 

tctgattttg tgtgtgtgtg tgtgtgtgag 8460 

gctggagtgc aatggcacga tttcggctca 852 0 

ttctcctgcc tcagcctcct gagtagctgg 8580 

aattttatgt gtttttagta gaaacagggt 8640 

tcctgacctc gagatcctcc cgcctctgcc 8700 

cactgcaccc ggaccatttc tctgattttt 8760 

ttagaaaaca tagtttgttt cacctggagg 8820 

ttgaaacaaa tcaagttaaa taatatggtt 8880 

gagtttcttg agatgagagg ctccagtgct 8 94 0 

ggatggctac cttgggctgg accaaaagtc 9000 

agtgcttgcc atctgatgat tctgagctag 9060 

gaggtaacag acaagtatcc aggcaattaa 912 0 

aaagttgagg aaattttgag agcaacccaa 9180 

ggactaaaaa cagcatcctg gaagtatgac 9240 

gggcgcattg actcacgcct gtaatcccag 9300 

ctaggtcaag agttcgagac cagcctggcc 93 6 0 

tacaaaaatt agctgggtgt ggtggtgcac 942 0 

ggcgg<?atca tagcttgagc ccaggaggcg 94 8 0 

gcactccagc ctgggagaca gagcaaagct 954 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 0 

gaaggagagc agaaattatc caggtgaagc 966 0 

atagcacaga ggattgaaga catttgaggg 972 0 

aaaataaggc tggatggagt tttgagatag 97 8 0 

ggcgagagat tatctggtca taaagcagca 984 0 

ttagatattt aatccaaaat acacacaaat 9900 

ttttggctat tcttatttat actttttgca 9960 

agtttgcttt aagctttatt gatcttccct 10020 

ggggggttaa gggttccttg tcatttcaca 10080 

aatttaatct tggcatgaaa atcaatattg 1014 0 
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gtaggaaact ttgatgaaga gtattggtag 
ttatcagtat tggtagagaa ttttatgaat 
5 acatggcttg actctcactt ttagtcaatg 

gtcatacaca cccccagagt agagtatcag 
ttagcttgac aaaaaattaa gttattagaa 

10 

gagtagaaat taagatacta tactttcttt 
gggtacatgt acaggttcgt tatataggta 
15 ttgtcaccca ggtactaagc ctagtactca 

ctctctgata atctccagtt gttttgttcc 
agctcccact tataagtgag aatatgtggt 

20 

taaggataat ggcttctact tcatccatgt 
tatggctgca tagtattcca cagtgtgtat 
25 tgatggacat ttaggttgat tctatgtctt 

tcacgtgcat gtgttttatg gtagaatgat 
gatagttatg gcagaataca gcatcatttt 

30 

tctccctcat cagttataaa agcatataag 
ctatgaatta ggaagttatc ttgcccaaca 
35 tactttagtg atacatattt ttcagctgta 

tatatataca aacttgaagt cctttttgtt 
ccacctgttc acaggttgag aattccgtac 

40 

ataaaattaa aattattccc atttttacag 
catgcaacta gtaagtgatg aaatctggat 
45 tctgaaaact gaggttccca ttactaggtc 

tgaatttgga tcttccatag tgcttaaact 
gtcttctata tagtactata taatgtggtg 

50 

tgtttttagt aaaatctgaa atgatttata 
ctctctatac ctcaattttt ttcatctgtg 
55 attaagtata ttaatacatt cacagcatcc 

taaataaatg acacctacta ttattactgg 
agaccactgc tcagagaaac catatgtctg 

60 

ggactatata tcagtgcatg caaaatgaga 
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caaaatttta 


tgacatttat 


aataaatgtt 


10200 


ttttatagca 


attcccagtg 


taataataaa 


10260 


cttagtaaat 


ttgtcatcac 


acttaaatga 


10320 


aaaggtaaac 


atcctgaaag 


tggcgtgtat 


10380 


aaatggatct 


agtatataat 


ttgtaacaaa 


10440 


ttcaataaaa 


agcttttatt 


ttaggtttac 


10500 


aattcgtgcc 


acaggggtgt 


gttgtgcaga 


10560 


atagttattt 


ttcccgatct 


cctccctcct 


10620 


cctctatgtg 


tccatttgtt 


cttagcattt 


10680 


atttggtttt 


ctgttcctgt 


attagctttc 


10740 


tcctgcaaaa 


gacatgatct 


cattcctttt 


10800 


gtaccacatt 


ttcttcatcc 


agtctacaaa 


10860 


tgctattgtg 


aatagtgctg 


caatgaacat 


10920 


ttatattaag 


atcttattct 


taacccaatc 


10980 


atgtgtttgt 


ttatggttca 


tctcatgcca 


11040 


cattttacaa 


ctattttgtg 


taggagggta 


11100 


ttcagaagtc 


caatgtaagc 


attctgtgtg 


11160 


atgattgagg 


tggttgtttt 


gtgaccctta 


11220 


agaaggcaga 


gtataagtta 


aaagctgata 


11280 


attatcaaag 


tcaactctcg 


caatgttaat 


11340 


aagtgaaaac 


tgaggttccc 


actacaaggt 


11400 


ttgaattgga 


atcttccata 


gtgcttaaac 


11460 


atgcaactag 


taaatgatga 


aatctggatt 


11520 


ctaccatgct 


atgttggtac 


tcagaaagac 


11580 


atttgaaaaa 


tattggacct 


tagagtagaa 


11640 


agccatatga 


gtttgggtaa 


gtttctcggt 


11700 


aaacgggaat 


aataataata 


ttataacagg 


11760 


agacagtgct 


tggcatatag 


taataagtac 


11820 


aatgtgacta 


aagccat tat 


tcttataaag 


11880 


gttgcccttg 


actgacttga 


agaaaggcag 


11940 


aagaaattgc 


tctttaccat 


tccaaagtta 


12000 
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agtgaagcac 


caaggcctac 


tacaatgacc 


cgatgatgtc 


tactactcca 


aggcaagaca 


12060 


gaattgaata 


ttgtacaaac 


tctgttactc 


tgcctggtgt 


cccatttgtg 


gagagtcatt 


12120 


gtaatgtatg 


gacagaattg 


aacccttctg 


taacactcaa 


aattttagta 


gctagtggaa 


12180 


aaagggaaag 


catgaaggct 


ttggatttct 


tttctatcac 


agccataaac 


aaccccaatg 


12240 


gcctcccacg 


aagttgataa 


tgcagagctg 


gggtcagcct 


ctgcccatgg 


taccccaggc 


12300 


agtgagacgg 


gaccagaaga 


gctgaatact 


tctgtctacc 


accccataaa 


tggatcacca 


12360 


gattatcaga 


aagcaaaatt 


acaagttctt 


gggtaagtca 


gccttagttt 


aaacactgat 


12420 


ttaagaggga 


agaaaataaa 


tacagttgac 


ctgtgaataa 


cgtgcgtttg 


aactgtgagg 


12480 


attcccttac 


atgtggattt 


cttccacctc 


tgtttgccat 


cccgagacag 


caagaacaac 


12540 


ccctgctctc 


cctcttcttc 


ctcagtctat 


tcaacgtgaa 


gaggatgagg 


atgaagacct 


12600 


ttatgataat 


ctacttccac 


ttaatgaata 


gtaaatacat 


tttctctttc 


ttatgatttc 


12660 


cttaacgttt 


ttagtttctc 


tagcttattg 


tatataatat 


gtcacataca 


aaatatgtga 


12720 


taatcgactg 


tttatattaa 


tagtaagact 


tctggtcaac 


agtaggctat 


tagtagttaa 


12780 


gtttttggga 


agtcataagt 


tttacgtgga 


tttaccacca 


tgaaagggtg 


ggcattccaa 


12840 


accccaatgt 


tgttcaagag 


ccaactgtat 


catgtagaaa 


accttgctta 


tgctgagaag 


12900 


gaggaaattc 


tgagttattt 


atcactgact 


taataatgat 


tggagaaaag 


acagggactt 


12960 


cttatagatc 


tggggagcag 


aaatgcaaag 


gcattttctc 


atggtagaaa 


acatcaaacc 


13020 


ttggagaaca 


gattccaaga 


cccttcattt 


tttgttttgt 


gccttttgac 


cagccatttg 


13080 


aggaggctga 


gctctatgaa 


gcttatattc 


tatattttaa 


atttaaaaat 


tgggttgaga 


13140 


gaagggttaa 


aaattgggtt 


gagagaaagg 


ctgagtagga 


tatggcttat 


cctctgtctg 


13200 


ccttcatcat 


attatgatgc 


aaaagcatgc 


tgggaagaga 


ctttgactcc 


atctacctta 


13260 


ccaatcatgt 


ctgttttact 


gtcctgccac 


tttctgagag 


gagttcgtga 


gaatgaacag 


13320 


aaatgtgggg 


gagaggtgaa 


gacaaaatgc 


tgccactgga 


attggtgtgg 


atagcgtgga 


13380 


ataaaaaact 


ctttcagata 


agctctcgtt 


ctgttaattt 


aggcttgaca 


ctttaaccat 


13440 


ttcaagcacc 


gacatctcct 


ttctgaggaa 


aattaaaatg 


gcctgttttc 


agtgtccact 


13500 


ttagtatatg 


actgttacgc 


ctttttcccc 


tttcttcttt 


caggccatcc 


agatcctgaa 


13560 


tgcagcaatg 


attctggctt 


tgggtgtctt 


tctgggttcc 


ttgcaatacc 


cataccactt 


13620 


ccaaaagcac 


ttctttttct 


tcaccttcta 


cacaggctac 


ccgatttggg 


gtgctgtgtt 


13680 


tgtgagtatt 


cgtactcccc 


tggctatatg 


ttatttctta 


gataccactg 


ctggagatgt 


13740 


gtttgatgac 


aaaaatattg 


tttgtaggtt 


attggaaggt 


tgcatgctgt 


ggttttgatt 


13800 
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tgcaattctt gatataaaca attagagacg 
aattacttaa ttaatttatt ttttgagaca 
5 cagtggcatg atctctgccc actgaaaccc 

cctcagtctc ctgagtggct gggattacag 
tttttgtatt tttagtagag acagggattc 

10 

tgaccacaag tggtcctccc agcttggcct 
cctgatgcca ggccaaaaat ctctttctga 
15 tgaagtccta aaaattatct tatatataac 

ttgaatgaaa aatgggtatt gatgaattgc 
gagttaagaa atttgtctgc tgattactgt 

20 

tgggtcaaca ttcaatcaga agaaataggg 
attatttatt acttccctga tggttcattt 
25 tttgtatatc ttcttttgag aattgtctat 
tgtttgttta tttcttggtg atttgtttgg 
tgttagatgt atagattgtg aagattttct 

30 

tgactgttcc ttttgccgtg caaaagctcc 
ttgattttat tgcatttgtt tttggcttct 
35 cctaagtgaa gtaactcagg agtggaaaat 

gagctaagct atgaggatgc aaaggcataa 
gggaaaggtg ggaaggaggt gagggataaa 

40 

ttgggtgatg ggtgcaccaa aatctcacaa 
aaacaccacc tgttccccaa taacgtatgg 
45 taaaactaat tgcatggaaa gactaaataa 

tgagcagcat caagtacaat tctgtgtcct 
gactattgct ggatatctgc taattgaggg 

50 

tattccttta tctgttttcc ttcctatttt 
tgttgtagca gggataaaac ccacaagaac 
55 gatcagagga gtagaaatat attattcata 

aactgagaca ctgagtatca tttaatgcac 
ggtaggtcag cagatcacat acattgtgag 

60 

ttggtgaact tgaagcttag aaaaagtctt 
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ataaactctt 


ttttattttt 


taatttaatt 


13860 


aagtctcgat 


ctgccatcca 


ggctggagtg 


13920 


tctgcctccc 


cagttcaagt 


gattctcttg 


13980 


gcatgcacca 


ccatgcccac 


ctaatttttg 


14040 


accatgtttt 


ccaggatggt 


ctcgaactcc 


14100 


cccaaaatgc 


tggaattgcc 


aggcatgagc 


14160 


ttaagagaaa 


ccacatttgc 


caagtacaaa 


14220 


tatatgttac 


atttatagat 


acttaaaaaa 


14280 


cttgtggtac 


agcagagagt 


cctggcttga 


14340 


ctgaaagtca 


gaagaatcat 


ttccattatc 


14400 


gtgattttaa 


aatgtattca 


ttcagttttg 


14460 


ttaaaaagtg 


aatgatatgt 


ttgttggcca 


14520 


tcatgtcctt 


agcccatttt 


ttgatgggat 


14580 


gttcattgca 


gattctggat 


attagtcctt 


14640 


cccactctat 


gggttgtctg 


tttactctgc 


14700 


ttagtttaat 


taggtcccag 


ctatttatct 


14760 


tcgtcattaa 


atccttgcat 


aagccattat 


14820 


caaacatcat 


atgttctcac 


ttataagcgg 


14880 


gaatgacaca 


atggactttg 


gggactcagg 


14940 


aggctacaaa 


ttgtgtgcag 


tgtatactgt 


15000 


attaccacta 


aagaacttat 


ccatgtaacc 


15060 


aaataaaaaa 


aatttaaaaa 


ataaaatgaa 


15120 


ataaataaat 


aaataaataa 


ataaataaag 


15180 


aggatgtgaa 


tttaaaaggg 


aggtttgtat 


15240 


aggaccaaga 


agaaaatggt 


tgagacttaa 


15300 


caaacagttc 


tgtagttcag 


gaaccttgtc 


15360 


atgggtaagt 


agcacttcct 


ctttttctat 


15420 


aacagtagga 


cagtaagtgt 


ccatgaatac 


15480 


aaaatattag 


ttttcttaac 


gtgttcagat 


15540 


gatgcattcc 


tgggatgctt 


gctcttaata 


15600 


cttgattttt 


atgtgttgtc 


aagtttgacc 


15660 
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aattctaatt 


atttggaagt 


aagaactccc 


5 


aaacttctat 


tctaaacaac 


tgctaagtct 


cactgatact 


ccactttcct 


actccattca 




ttaccatttt 


attgtatcct 


caaacaccaa 


10 


cttyaL L. L L- LL 




y ciy clcxcx o. ^ l. y 




actactaatg 


gactagatct 


gtaccaatgt 


15 


aaattttgtg 


attttcttta 


aagctgatct 




tttccccaca 


caagtgcatc 


actttaattt 




gtctcactct 


gtcacccaga 


tgggagtgca 


20 




y ^ a. v ci cs. 


trrtrrrarr 

V_» \_» V_k \_* Vj» v^- 




gtgtgccacc 


atccctggct 


aatttttgtc 


25 


tcttgccatg 


ttgcccaggc 


tggtctcaaa 




gcctcccaaa 


aggctggggt 


tacaggcgtg 




taaacatcca 


gttactgttc 


tcaacttaaa 


30 


' ^ 1 — 1 — d vQ U I. 


afttcfrrct 


cr c 1~ a cr aa a a a 

y y ^ 




aactccttca 


aaggaatcat 


catgttcttt 


35 


acccacttca 


ataagacttt 


attttattcc 




gatcactgaa 


aacttccacc 


ctgctaaaac 




aagctatgag 


tagcatttag 


catagttagt 


40 


cttggcttcc 


agcatacatt 


gcttggctgg 




cagtctcctt 


tcctggttcc 


tctccttgtc 


45 


gctgaattct 


ttgccacctg 


ctcttctcaa 


attcttatga 


cttcaagtaa 


tatctgtatg 




caggtgtctc 


tcctgatcat 


cagactggca 


50 


tcttcctcaa 


accttctcct 


catgtcagct 




accaaacacc 


ttagaataac 


caagagtctt 


55 


cagcaaataa 


tgttggctct 


actttcaagc 


atctttctat 


tgatgacatt 


tcatccaatc 




ttttatcgtt 


gaacttctct 


ttgccttgct 


60 


caagtatatc 


atttatttgc 


tccaaatcct 



accactaact 


agtttgacaa 


ctgaaagtcc 


15720 


ctatgatgtg 


ttcttagtga 


gtcacgtagc 


15780 


ttcactttcc 


ctcctccatg 


gatagctatt 


15840 


tcccacttct 


gtcatcctta 


ctctctgctg 


15900 


aagccattag 


aagggaattt 


tcagacctcc 


15960 


atttttttct 


tctagcctct 


taccatagga 


16020 


ccctacttga 


gtataagatt 


ccattctttt 


16080 


ttgtctctct 


gctatatata 


tatttttcga 


16140 


gtggtgtggt 


cacagctcac 


tacagcctcc 


16200 


tcagcctctc 


acatagctgg 


gactacaggt 


16260 


tttttttttt 


ttttttttgt 


agagacagga 


16320 


atcttgggct 


caagcagtcc 


tcctgcctca 


16380 


agttaccttg 


ccccatctct 


aatcagcatg 


16440 


aacaaaacaa 


aacaaaacaa 


aacaagaaaa 


16500 


ggcacaattt 


gtattttcct 


tctgtagcta 


16560 


ctctgatttc 


catctaccgg 


tttgaatcaa 


16620 


ctactatcct 


tctgtaactg 


ctgtctttaa 


16680 


aacggtcaac 


tcttgttcct 


catctaactt 


16740 


cgcccactcc 


tccttccagg 


aatttcctca 


16800 


ttttcctccc 


atcttttgct 


ttctccatct 


16860 


tctgaccttt 


taatgttctg 


gtgccccaga 


16920 


tctatacgta 


cttcttctgt 


gatctcacac 


16980 


cccacaaccc 


tgaaataagt 


atttccagcc 


17040 


tgtaaagatc 


caaagctcaa 


ctcccagtct 


17100 


gatgggaatg 


tcatccttcc 


agattcccag 


17160 


ctttctttca 


caatcctgaa 


tcctatttat 


17220 


tattattata 


tccagttttg 


accttttctt 


17280 


caccttgtct 


ctcctaattg 


ggcttctagc 


17340 


agaaacagtg 


atcccttgaa 


accataaagc 


17400 


tcattggctt 


tgtatttttc 


ttagagttag 


17460 



-112- 



WO 02/062946 PCT/US01/48437 

aaacaagttc cttttcatga tctgcagagc cccacatgat ttggctctct gttaactttc 1752 0 

cttcttcttc atttcctagt attgactacg tgcctccaac gcccttgacc cctttgctgc 17580 

5 cccagaagat tctggcctca ggattttgag tgtttcccta atttcaagga tattccccca 17640 

gatatcctcc tggctggcac ccacagctct ttcatgcctc tccccagtgt catttctatt 17700 

taatatcatg aacccccact cctctctttg gattcccaga ttctccttat cccattctgt 17760 

10 

tagcccctag tacttaatca ctttcaaaaa tattatgtaa tttacctata gtatgtttac 17820 

ttttttcttt gtttttattg ctagaatata agctccatgg agggtggggg acagaaatct 17880 

15 ttctttggtc attgatgtta gcctcgcatc taggatatca gtgggtgtgt aaacaatatt 17940 

caatcaatgt ttgtttcaat gaattgttgc caaaggaatg aagtacgtgc tgcatatatt 18000 

aattgaaaat tcttattctt ttttagatac agaacagttt tggaatgaac attgccagtg 18060 

20 

ctacaattgc actagtgggg actgcttttc tctcactaaa tatagcagtt aatatccagt 18120 

cattaaggag ttgtcactct tcatcagagt caccggacct atgcaattac atgggctcca 18180 

25 tatcaaatgt atgtttctga aaaatatgta taagtataaa atatttcaaa caatccagga 18240 

tgtacagaaa aataatatat caggtaccaa tgtgccataa tttgaatctt tttttttttt 18300 

ttttttgaga cggagtctcg ctctgtcgcc caggctggag tacagtggag tgatctcggc 18360 

30 

tcactgcaag ctccgccctc ctgggttcac agcattctcc tgcctcagcc tcnnnnnnnn 1842 0 

nnnnnnnnnn nnnnnntttt tttttgtatg tttagtagag acggggtttc actgtgttag 18480 

35 ccaggatggt ctcgatctcc tgacctcgtg atccgcccgc ctctgcctcc caaagtgctg 18540 

ggattacagg catgagccac cgcgcccagc caaccttacc aattttttaa aaatttgtgt 18600 

tcaggtactc tacctcttct gaactctact tagaaattaa tattttcctt tggaattatc 18660 

40 

catttcatcc cagtgatcta tgaactgtta attacagact cactcctaag cttcctgttt 18720 

tctttttatt attattatta tacttgaagt tctagggtac acgtgcacaa cgttcaggtt 18780 

45 tgttacatat gtatacatgt gtcgtgtttg tttgctgcat ctattaacat gtcatttaca 18840 

ttaggtattt ctgttaatgc tatccctccc ccagcccccc accccacgac aggccctggt 18900 

gtgtgatgtt ccccgccttg cgtccaagtg ttctcatagt tcagttccca cctatgggtg 18960 

50 

agacactact tatggaagtt tgtgttctca ttttctttat tacattttta atgatctaaa 19020 

tttatatttg tcggagcctg atttttctgt gaatttggag gtatttttcc tccagggata 19080 

55 tattgtattg gttgcttcta ggtgcctcag taataatagt agaatcatta gcccaacaat 19140 

tttatttttt gtttatatta gctcagtgtt ttagtttcta gggataatgt aaatgcctaa 19200 

acccctatga agtgctaatg aaattgtgga agggccattt ttgttcattt ttctaaactg 19260 

60 

cctactcctc tccttagttt actactattc agattggaag atgttagtga taatgtcaga 19320 
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attttgtttc 


tacagcttct 


tccccctgta 


gctacttcac 


atgcgttgac 


atagaaatgt 


19380 


gacacatctt 


tctcattaga 


gaaaatgttt 


tcatgctttt 


agattgagat 


actatctctt 


19440 


ctacagtgtt 


atatgtgtct 


ctgtctttaa 


tttatggttc 


actttgttcc 


actttacggg 


19500 


agggagatgt 


tgtagtctgg 


tttcctgctg 


acattttttt 


ttttacctct 


ggatgttgcc 


19560 


actgggtttg 


atataaggca 


tcatatcccc 


tagggcatgg 


tgtctctact 


gctgattctc 


19620 


accttgctgg 


aattatgcgt 


aactatctct 


accatagcca 


tgtggtgcaa 


tgcaaactgc 


19680 


tgtaattcaa 


gagagtgaga 


tttttcaaat 


gattaatcat 


tcacatgatc 


tcaaagcctc 


19740 


cctgtagggt 


ttgaagttgt 


ttctgatgct 


tagaaataaa 


cctttatgtg 


gtaatgattt 


19800 


gtggagttcc 


tggaattaga 


gattgtatga 


atgttagaga 


tggaatgaag 


ggtagaaaag 


19860 


aatttggatg 


tgaacatgca 


agaggaaata 


ctgtacttga 


tatataagag 


catactttct 


19920 


tatacagctg 


taattaagtt 


ttattatttt 


cttataactt 


gtaactttct 


aattgtatga 


19980 


cctaggaaaa 


attaataatt 


ctttctaaat 


tttagatata 


ttaactgtaa 


aatgaaaata 


20040 


atattttcat 


tctcttctaa 


tgtttgttgt 


gagtactgtg 


tatataatct 


ttatggcatt 


20100 


tggcacagta 


aacatcaatg 


aatggatttt 


attgttgatg 


attatgatga 


taacaggaag 


20160 


ggatgggagg 


agaaggagat 


aatgatcctg 


ttcttggaca 


ttaatgttgt 


tttattttct 


20220 


aggaaatttc 


ctcacctccc 


aattctgtgt 


aatcaagaat 


acctccttat 


gaaaataatt 


20280 


ctgagagcat 


gaatatttga 


ccttaaatct 


ccagtgactc 


agagcttcac 


ccacaaactc 


20340 


aggagaacat 


aagcctgctc 


gtaaagctca 


atccttctat 


catggcacca 


atcacaagaa 


20400 


ccttggacgt 


ttgactgact 


ctatcctttc 


tctcctaact 


ataaatccta 


tttgtgtgtc 


20460 


gtgggtatgg 


aaggacagat 


atatttcttt 


aggcattctt 


ggatatctgt 


aacttctatg 


20520 


atcattactc 


caaagttgtt 


tccagaaatt 


ggttctattt 


cttcttatcc 


acctactcca 


20580 


ttgctttatg 


aggtttaagg 


aaggaaggcg 


gtataatccc 


tattcaatat 


attttttcta 


20640 


aaatccaact 


tctgaccgcc 


cagtaggaag 


aaaaatgaga 


cattttttcc 


attacagaga 


20700 


aatgcttctt 


gactttaaca 


tcagcattat 


aaaaagtgtc 


aaataaaaaa 


ttaccatcat 


20760 


tatcattaaa 


ataaattttc 


actgtatttg 


agatgggagg 


gttaaggctc 


agggatttta 


20820 


tttcagtgaa 


ctgctggaac 


tcacacatgc 


cctgatatgt 


aaatgatgat 


ttatgttggc 


20880 


gagtctgaga 


gcaagcccaa 


atgtgttctt 


caaaggacaa 


tgggaaactg 


taaagtagag 


20940 


aactaaagaa 


taaggccttt 


agaatctgac 


acatctgggt 


tcaaattctg 


aaactgtcac 


21000 


ttattacctg 


tatgaacatg 


ggcaaattat 


ctaatctctc 


tgatctattt 


ttcctcatct 


21060 


gtaaaatagg 


tgtaataata 


acaactactt 


tgtcggttgc 


tctgagggtt 


aaatgaaaat 


21120 
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aaaaagaaaa 


tgtgaaacag 


caccacaggt 




tcttgctgtt 


gctggtgaaa 


gctaaggtgg 


5 


gtggaaaact 


tacttacact 


tccattgtgg 




gccatctttt 


cgaggtggca 


atgtggcgtg 


10 


ctgctaaagc 


acttatattt 


cttattcata 




agaaatataa 


gcacttattg 


tgtccataaa 




gtatatcgta 


tgtgcaaagg 


ttcttttcta 


15 


ttctttattt 


tttaatccat 


tctgcagcgg 




ttgaggagga 


attatttaat 


tatttaggtg 


20 


actgagcccc 


aaacaaagag 


tccggttacc 




ctgtgcaggg 


ggaagcatat 


tacagaagtg 




tgtattacat 


tatttcttat 


tttttcaagg 


25 


gcctagtgac 


cttgcacctg 


cacagctaga 




ggagagataa 


ggcttactag 


ccgcagaaaa 


30 


ctttctcttc 


ttcaggggga 


attgggtttc 




attctttaat 


ttcttttaat 


tcctgttcta 




tgtgtcacgt 


attgtgctaa 


gagcttggca 


35 


cttaagttgc 


ttgtagtggg 


agatacagac 




agagtatagg 


aatcttgaga 


agcacgtagg 


40 


tcagtgataa 


ttttttagaa 


cacttttctt 




ttaacatgat 


ggaaggcaaa 


tggaagaaaa 




aaaattttga 


atattcaaag 


aataaactag 


45 


atgggcagat 


ggaaaaataa 


aagtttggag 




gacagcactg 


tgagtttttt 


tttatttttt 


50 


ttattgatca 


ttcttgggtg 


tttctcacag 




gtggagggaa 


ggtcagcaga 


taaacaagtg 




gaccctgcgg 


ccttccgagg 


tgtttgtgtc 


55 


gactcttaac 


gagcatgctg 


ccttcaagca 




cttaatccat 


tgaaccctga 


gtggacagag 


60 


taaggtcata 


gatcaacagc 


atcc 



PCT/US01/48437 



acttgacaat 


gatgttgctg 


ttgtggttgg 


21180 


gattgtggtc 


agatgtgctg 


gagagataca 


21240 


ctcctcgtat 


aacctccaac 


aactccaata 


21300 


aaaacttccc 


tcagttcctc 


gtttataaat 


21360 


aatatccaag 


aaaaataaaa 


tttatgaata 


21420 


tttataccaa 


taatttattc 


ttagttacgt 


21480 


acacttgttg 


cctacttata 


taataattat 


21540 


gataattaag 


gaaccagaga 


gactgagggg 


21600 


caccgaccca 


gtcggattaa 


catccaaagg 


21660 


ttttaagcat 


tttgtgcagc 


gggggacgat 


21720 


agaaacaaag 


acagttattc 


aattaagaca 


21780 


aaaaacatgt 


tttacgactt 


gagtttatct 


21840 


gaaatagggt 


cttcacaatg 


cctgggaaag 


21900 


acaggcagtt 


aattttaaag 


gactccagct 


21960 


tcttacatac 


aaccgagctt 


ttgcttacac 


22020 


attcaacaat 


catttaggga 


gtgcctacag 


22080 


tataatagag 


aatgatgcaa 


aatctctgct 


22140 


ccctagaaac 


acacaagggc 


tctcatagga 


22200 


acgagattct 


aacccagatt 


agaagacatg 


22260 


ctcaactgag 


ttcttaaaga 


taactgagcg 


22320 


gaggatacat 


taaaagcctt 


taagttagaa 


22380 


agactttagt 


gggtctaact 


tcagtgtata 


22440 


agaggtgtac 


atcacaagag 


attataaaat 


22500 


ttatttttta 


ttttttattt 


atttattttt 


22560 


agggggattt 


ggcagggtca 


taggacaata 


22620 


aacaaaggtc 


tctggttttc 


ctaggcagag 


22680 


cctgggtact 


tqaqattaqq 

ZD ZD ZD ZD 


qaqtqqtqat 

ZD ZD ZD ZJ ^- ZD 


22740 


tctgtttaac 


aaagcacatc 


ttgcaccgcc 


22800 


cacatgtttc 


agagagcaca 


gggttggggg 


22860 



22884 
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<210> 76 

<211> 51959 

<212> DNA 

<213> homo sapiens 

<220> 

<221> genomic_DNA 

<222> (1) . . (51959) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (29933) . . (30028) 



<220> 

<221> coding_region 

<222> (34943) . . (35011) 

<220> 

<221> coding_region 

<222> (38775) . . (38831) 

<220> 

<221> coding_region 

<222> (40333) . . (40492) 

<220> 

<221> coding_region 

<222> (42979) . . (43079) 

<220> 
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< 2 2 1 > coding_r egion 
<222> (43668) . . (4373« 

5 

<400> 76 

nncaactccc tcaactggaa 
10 tttgtggaag gagatcacag 
actatcaaca tgacttacca 
cttcttcact gtaaatttac 

15 

agtcactatt cacagcccag 
gatggcctgt gtgttctttt 
20 ctgtgattca aacagcaaat 

ctttagaggt cctcctgagt 
atggtaagcc agtagatgag 

25 

ccagtagcca aataggtcta 
gacagagaag ttcaaaatct 
30 aattatgccc aggagtttta 

tgtgttaatg gcccagctgg 
ggcagtccct tcatctgagc 

35 

tttctccagg agtgagactg 
gagcttaggt ggccttgcct 
40 gcactgcaag ggacagtgtc 
gttctacaag attgccttct 
aaaccaccca cgcattcgca 

45 

gattatattt ccttgtgttt 
tctccatctc ttagtgtgct 
50 gcattccctt tttaaaaaaa 

atttttagta tcatataact 
ttcaatttgc atatgtatat 

55 

taaatagtaa gtatagtgta 
gtataacatc ttatattttc 
60 tttctttttt tctattagac 



gttacctgat tttttctcat 
atataagagc cagtatcatc 
ctgttactct tagtcacttg 
tctttttatt cccctttaca 
acttaagggg tggggagtta 
gccataggct ccactcagtg 
ggagtcaatg gggtccagga 
gtcctgttgg ttggggtcca 
tatattcaag agatctacat 
ccaattgctg gggttttttt 
tgacaatcat ctttgcaatg 
ttagtttcat ttggatatta 
gttgctgcat gtggatcaac 
ccaccaggaa gagacaatca 
tgtgtaactc atggcccatc 
gtgcatgtgc tctgctgtat 
tggagtgaca tggctgcatg 
ttatcatgct tacctctgaa 
gtataattac aacttggtaa 
taaatatgat ttttcagaga 
ggttgggttt agatcaagat 
aatcttctga atattttgaa 
cacttctcag ggcatggtgt 
tttaaattac ttctaatttt 
ttcattttta tttagattac 
ctttctattc accattattt 
tgccaaaatt ttgtttaatt 



gattagactg atgtttgggt 60 

acatcccatc aacagcccac 12 0 

gttgaggtag tgatgtcagg 180 

ttctgtattc tttggaagga 240 

cgtttatgaa tccaaagccc 300 

aactgagagg ttgctgagac 3 60 

gaagagtggg gtgggtggcc 42 0 

actccatggt aacagtcttt 480 

caggtattgt aggagtgata 540 

tgtggttttt ggtggggagg 600 

agaggctttt ctgctttctg 660 

ttaggtcttt ttatcttgtc 720 

agggtgtcta gtccttgttg 780 

atatagcaga agggcagagc 84 0 

actccatggc agatacctgg 900 

tggtggtggc tgttctcagg 960 

gtgcccctct ccctcactaa 102 0 

gtctgcagtg aaatattatt 1080 

tgatatatgt tacactgtgg 1140 

ctgaaaaata tctgaaaact 1200 

cttttaaaat gaatttgtga 1260 

taaaatttcc attgtctatt 1320 

tttatttatt taattattta 1380 

cagggtatta atatttattt 1440 

taatatatca gattttaaat 1500 

aattgttctt atttttccaa 1560 

tccttcttat tatttgatgg 1620 
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aagacccaag aattgcccta tggattaaac 
aagttgtacc atatttaagt tggcttcgca 
5 aagagagaaa aggcagtatt cctggaaaac 
agaaacacac attactattt aggcagagtt 
taatgaagta ttaagtgcat ggggcttttg 

10 

cacatcagtt agctgtgatc attgttggaa 
tagggcagac aaaggagtga cctgtagtat 
15 cactttcctc ttcccagcaa tctgtgactt 

tgactccgtc cctgcctcca ctgtgctctg 
ccccctttcc actgcaatga ttgaatcagc 

20 

gaatgattct caatcgtctt gttttgaaac 
aaactacagg gtgatatatg gatgtaaggc 
25 ggtgggaaat cagcctaagc atgaagccaa 

cacaaaggta cacaagagtt ggggctggag 
agctccaatc ttatctgaag ttcacctcaa 

30 

ttttttgttt tgactttctc tatatgataa 
gcatttagaa acacaaaaaa tgatttgttt 
35 cagattgtaa tttagataag taaacaatca 

ggatgaaaat tgaggagaga aaaacaaaga 
taggttccca cttcaaacag cttatccagt 

40 

acccccttgc ctttgctgca ccccaacttg 
aaccccagtc cacattcctt tccttattag 
45 aatgacccct cgtctctctc ttctcacctt 
taaatgttta gccagtcacg actagttaga 
aacacacaga agcagaagct gcattagaga 

50 

gtgctctcgc cattgctcca tatgcgagac 
ctgagagacc ctttgtcctt tgtttcagcg 
55 tttccaacaa tggggacagg aatggaaaaa 

ttctcttgac taatatatga aaaacactgg 
cccaccagga atttaggctc cttatgggta 

60 

aaaccgatac taaattgaca aatcttgatg 
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aatcacagac 


aatcaccatg 


taggccctca 


1680 


ttttctcagt 


tctcatgctg 


gatqtqqttg 

ZJ ^ ZJ ZJ ZJ **" 3 


1740 


agaactggaa 


gtgqctgaac 

ZJ ^ Zj ZJ ZJ 


tgtcaggcca 


1800 


ccctattaag 


acagagctcc 


cttaatattt 


1860 


gtttgacatt 


cacttgggaa 


aggctgaatg 

ZJ ZJ «J ~J 


1920 


gaatggatca 


gaagaaagtg 


tccttgggca 


1980 


gcacttgttg 


cctqcccaqa 


atccatcttc 


2040 


ctgactgggg 


atccatgtaa 


ttctgaggac 


2100 


gctctaggcc 


cagtcaatgt 


gcttttcaca 


2160 


qtqqacatqt 

ZJ ZJ ZJ ZJ 


aaccttacat 


ggtctcatca 


2220 


tctagactat 


tctaaaaaca 


ctattcttcc 


2280 


ctagacttga 


tgtagacatt 


tttgctaagc 


2340 


cacacagaag 


aqqqtaqagc 

***■ ZD ZJ zj w *** 3 zj 


taatacacta 


2400 


cctccatgac 


atctaaacat 


ctaaaccagt 


2460 


ctctgtagtc 


ttcagaaaca 


ttagcctaca 


2520 


ataatgtact 


cctccctcct 


cccatccttt 


2580 


aatgtcacaa 


agaccactat 


ttctccaatt 


2640 


aaatccttta 


ctctgagccc 


tggaaatgaa 


2700 


aaccagggca 


ttcatggtgt 


tagatacagt 


2760 


ttccctgttc 


tctatcctat 


aattccacat 


2820 


tctgaaccta 


qqcatqcctq 


aacttgctac 


2880 


qqaataqqtt 


accttcctaa 


tgtccccgta 


2940 


ccttacatgt 


ctaccttatc 


taagaaagtt 


3000 


ctgtgctgtc 


caaccctagc 


caatagagga 


3060 


taataaaacc 


cctgctttcc 


tttgttctgt 


3120 


acacccttct 


gcaqaagtaa 

Zs 3 ZJ 


atttgccttg 


3180 


ctagttcttt 


tttgtgacac 


tgagcatgtg 


3240 


tgagttggag 

3 — ' ZJ ZJ ZJ 


agaggcaata 


aaaatctttc 


3300 


caccaccaag 


taaaaaagat 


aagatcccct 


3360 


ctcagacaaa 


cagtttttct 


aagctgtagg 


3420 


aatacctcat 


aggctgtggc 


attcaatgct 


3480 
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actattttgg 


accttacggt 


gcaaaagaaa 


5 


aagtaattac 


actgtctact 


attgctacat 


ataattttcc 


ctgttgttat 


tactctcttt 




ttgctacagc 


aacaattctc 


tgcttcacac 


10 


tatgtgaaca 


gggctgccat 


tttgttctca 




caaatgagct 


accactcaca 


gttattttat 


15 


tgcagaccta 


tgcaagagat 


cagtaatacc 


cagtacatag 


ggagcatcac 


gagagcacaa 




ttgggaagag 


gaggattcat 


tgaaaattaa 


20 




o^aua upturn i> 


tu. Vj> 0-^5 ■ t* ^3 3 




gaaacagttt 


ggcttactgg 


ggggaactgc 


25 


gagaagtgac 


agtttcattt 


gggtgagagg 




cacaggactt 


caaatgctta 


ctggagacat 




acagaaagat 


tcaagcagca 


cagtgacaaa 


30 




tot" aaoaata 






acatgggata 


gaatattaat 


attaggcaat 


35 


caaaggaggg 


catttataat 


gatatgttgt 




attctactaa 


taaaaaaatg 


cctaggagtg 




agaattttta 


acatatgtaa 


gttttgagga 


40 


aaaatactta 

£A CI U U l*> C*. *w l^- w 


aatgaatcaa 


cactaaagaa 




ggcaacaatt 


gaccaaatat 


tcatcccaaa 


45 


ccacattaac 


taatcaataa 


agctagaggt 




gattaatttc 


aagcccaatg 


aatatttata 




gcctaaagtg 


aatatatttt 


ggtcatacta 


50 


ctt"aaataaa 


aacataaaaa 


atttttatga 




tagacagaga 


cttgactgta 


ttgacttatt 


55 


ggacgtaata 


taatatactt 


tcaaataaat 




tgtccatgtg 


gaatgagtat 


gactagccaa 




aaagacagta 


ctatcagaaa 


ttaaaatgga 


60 


gtgttaaaaa 


agactgaatg 


gcaagaaaat 



tacattaata cactgcctaa cattcatatc 3540 

tgtcttattt tgctcttgtt ccaaattcac 3600 

tcatgaatga agaggttttt gaaaacaggt 3660 

aggagggact cctaagtgcc agcctctgta 372 0 

ccagtctaaa gttatcctag gttcctattc 3780 

tcagccaata tttattaagt atctggccca 3840 

tgtagaagat aatgtccctg cccttgaaca 3 90 0 

aaagttcaga tgtgttggct aaaatttccc 3960 

cattcaaata atgctttcaa agaagagcag 402 0 

ggaatcagtg cgcacagagg ctagagccct 4080 

aaacactttg ttatgattct gtcacagtgg 4140 

taagatcctg gacgtgcaaa atgtggaatt 4200 

tgggctttgt catgcagtca atgtgcagac 4260 

atcccatttc tattccaaaa gttcctatta 4320 

aatgaaggtc agaaaatcca aaacaataag 43 8 0 

aaaagttcta aggaaaattc tcaggtatga 444 0 

ttagaatctt atgatcaaat aacatataat 4500 

tgagaaacta acaaaaacac agtactcatg 4 56 0 

atgtaagcca aaaaagatga tttgaattac 4620 

agcatgcttt ttaagaaccc ttggtacatt 4680 

acctcactaa cttaaaacaa aaaatccagc 4740 

aagtaataca aatatttgtt accaaaataa 4800 

gacttttaat tgttaataac cagattaaat 4860 

aaacaaaaat ataaagtgtt cagtaattca 4 92 0 

ggaaagacat acgcctttta tttagaaaca 4 98 0 

tctggttgtc agttattcca catttttttc 5040 

ctcttgtgga aattgactta atgttcttta 5100 

aagtatttta ataaagacta atgttgaagt 5160 

gtatacaata actaatacaa gtgatgcgtt 5220 

gagaatagaa tgcccacata gtaatcatag 52 8 0 
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aaaaaatata aatggatgaa gtgcaggacc 
catgcactta tgcatgaaaa aatggatttg 
5 gaaaagtgat gtgttttgaa atgcaaaaag 

tgagagcacc acaagaggaa cagttaagat 
acaggacatg caacattgca cttttcacag 

10 

cccaggtact ttagttcagt aagactatgt 
gtggtgaaac tcaagccctg ccgtcttctt 
15 tgcccatgag gcaatttcat ggcatggcat 

gtcaagtgag gaagactgaa gccgaagact 
ttgtccttac ttgtatttta aaaacccagg 

20 

atatctgaca tttggaacct tgtattatta 
gcacaggctc gggactgatc ctgtaacaaa 
25 tctcagccaa atggctcagg gttgacccta 

gctgcctacc atgccctgaa taaccagtta 
cagggggtgt taatgtcatc ctttgtggag 

30 

ttttcatcaa gaatttagtt gctgatccta 
atttataaaa cgagagtgtc cagattgaag 
35 tggaatacag atcaaattta atcaggtaaa 

aatcacctgc acaaacctga gatgggacta 
aggcctgggc ccttctacta gttagctgaa 

40 

agaagtctgc ctttgcctct agagcaatgg 
ttgggaaact ttaacacgta ctaacacatg 
45 aatatggatt atggttggag agtcaaaatt 

tagtcaatgt tgccaattac tgctataagc 
aatctggagg gcttcttaaa acacagaatg 

50 

ggggtagggt atttgcatgt caaccacaat 
aaccacattt taagagctac tgctctagac 
55 gctttttaaa ttctaccgct ggtattgagc 

tcttctcttc tgagtaggat ttattccata 
ctgtgtctgg agggtagggc caagttggac 

60 

taaggacaga gagtaggatg tgatgtcttg 
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agacaattta actaaagaaa tagaaatggc 5340 

aaagcaaata atttgtataa tgaaataaac 5400 

aaaaaaaaag tggttggtgg agaaaagctc 5460 

atgaaagtcc ctggaaaaga ggaagctggg 5520 

gatcagcacc aagagcttgg agttcaagtt 5580 

gatagctgtt atgcggtcat ctgggggctg 564 0 

attgtgttta gtcggaatgt acaacaatta 5 70 0 

tggtatattt gcttcagggc tagagtgaca 576 0 

gaacatcaga gttatacatc ttgacatcct 5820 

cccagataac tccaacactc ggtttctcct 588 0 

gttcagaaga agtactgaag aggcctcagt 5940 

ggtgatcaga tgacacccag gcccagctgg 6000 

aatatggaaa accagagaat caggactatg 606 0 

tatggattgt catctggtag aaatcctatg 6120 

tctcttggtt ggatcttcaa ccacttaata 6180 

tttgcaaatg atccaaaaac aagaagtata 624 0 

atacaaccca aaagtctgga acatgagcgt 6300 

attaaagtcc caagcttatg tttaaaaaca 6360 

gaacttgctt aacagaagtt aatgtgagaa 6420 

aggatggaga cagaagcaaa agactctcca 6480 

atctcaatct tgtctgagca ttgggataac 6540 

agccccatta ccaggtattc tgatttaatt 6600 

ttttgtagct ctttagcgga ttctaccgta 6660 

ctgtagatat caaacttgag tgtgtatcaa 6720 

cacaccgaga ctgtctgatt caataggtct 6780 

gtcaggtgat actgatgcta ctactctggg 6840 

gaacctggag ttctccagat gcaactaaag 6900 

cagaatgcat agaacctggc cctgtgccta 6960 

agctttggcc ttctctttac atcttcatat 7020 

ttcgggatac taaactgagg ccttttattg 708 0 

atgctgcttc aggttactag aggagacaat 7140 
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gatctcagtt 


cagaaccaag 


cttaaagtta 


5 


atgcaagcta 


ggatttactt 


ttcaatcttt 


tttgggaaga 


aattagggta 


aatgagcaat 




tctcctagca 


ctcatggtat 


ggcctgttca 


10 


tgcacaattt 


tcaqqaqtcc 


agaaatacaa 




caattgttgt 


catctctttc 


cacacacaca 


15 


caggcagttc 


ttacaatatt 


gagatgcttt 




atttacatct 


tataaaaata 


gaacatcaga 




tgaaatatga 


gaagtctgtg 


gcccagggaa 


20 


atgaatgtat 


aacccagttt 


ttctgttttt 




tgtcatattt 


aattgagcca 


caagagggag 


25 


tgaattccta 


cctaatctgg 


tgtcttctgg 




ctggtcttgt 


atactagcgc 


tgggttgggt 




tagtagatac 


agcaagtcat 


ccctctgacc 


30 


gtattattta 


gatattatta 


cttatattga 




ttttcaaaac 


atcacaagtt 


gattttcaag 


35 


tgctttttgt 


accatcctaa 


ctttatacca 




cactctggtg 


aactgaaaaa 


catattttag 




cctctccctt 


ctcctgtctt 


tccccacatt 


40 


aaqqqtQtqq 


atacagctag 


aqqqaqaqqt 




tctgaaaaat 


ctgggttata 


aatggcacac 


45 


ttttggaaca 


tggatcttat 


tgcagaaggt 




ccaggtggca 


acatctgtgc 


ctggcttctt 




cataagggaa 


ataacctgaa 


ccagcccgac 


50 


atagatcaca 


cattaaggaa 


atatgtccat 




atggatctag 


aagtgacttt 


gaaagtgaat 


55 


actgatggaa 


atgaaccaag 


catttatttt 




gggtggagag 


aggattaaaa 


gaagtgaaat 




acccattccc 


atcatggttg 


ggccagtacc 


60 


ccctttattt 


tacacagtat 


agaaattcac 
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tatatctctc 


ggggactatt 


tgtcccagga 


7200 


ccccagtttc 


tttcccaaaa 


gtaacatatc 


7260 


ggtaagctca 


aatcctggca 


tcaggaaaat 


732 0 


ccactgacca 


caaagacatg 


tagaaggtgg 


7380 


acagttctct 


taatacttgc 


tctgtagtct 


744 0 


tgcatgcaca 


cacacgcaca 


cacatgcaca 


7500 


gcttagactg 


ggctgtatat 


aagagacctg 


7560 


gttagaagag 


acgtggaaat 


ctttctacac 


762 0 


agaaagtgat 


tcacgtgagt 


catgaaggta 


7680 


cctattatcc 


ccttcatttt 


gacattgagg 


774 0 


gcagtggaca 


agaaattgtc 


ctacctaatt 


7800 


gaagacctta 


tgaaaagtca 


actcccagct 


7860 


cccatttcta 


tcccatattc 


tctgggaatc 


792 0 


cttagctttt 


tcatctgcta 


aattttgtag 


7980 


aaggtgttag 


tgaggatacc 


tgagaatgaa 


8040 


atagtattaa 


agtgctttaa 


tttcaaaggg 


8100 


cttttttgtc 


cactcccacc 


ccattctctg 


8160 


attttatcta 


attgttattt 


actctactct 


822 0 


ctgtaaaatg 


tgagcacatg 


catgatggag 


8280 


tgagactgca 


catctcacct 


ggtctcatat 


8340 


cagagacatt 


accatgccct 


cctctctgct 


8400 


gagggtttgg 


gaaggcttcg 


ggatgagtgg 


8460 


cccttcttgg 


tggaacagcc 


actggcctct 


852 0 


cctgagagcc 


tccctcatgg 


tgttggactt 


8580 


gagagatttt 


aggatgactc 


atatggtttc 


8640 


gaaccaacct 


agtagtttct 


agtgaagctg 


8700 


ccaatgccat 


agagtggatc 


ggtgaaagct 


8760 


aaagttagga 


cttctgaacg 


tttctctaat 


8820 


atttttggag 


tttttgtttt 


gtttctcttt 


8880 


tgactactcc 


tcagattttt 


ccagtcctgc 


8940 
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ctccccagag ggactcacat cccttttcat 
ctcctaggat tcccactcgt tactcaaaat 
5 ttaggtgaat aaaatttaat ctaataatat 

tgagataatg agtggaacca cgcatttatt 
agttatctgg accaacaagt ggtaagagca 

10 

ctgaggaaac agtacgtctg gagacacatt 
gttgtctgag atttctgggg ggctggatta 
15 ttttgagtga ggtaggaggc attggaagat 

gcatcttaac aggaccactc tggctgcctg 
gggaagatgt ggggagagta gcgaggaggc 

20 

ggttgtgttg ttggaacaca gcatcactgc 
gcctctcttt ctttcactcc ccacattcag 
25 tgtccccaag aaacataatt catcccatgt 

cgcagaattt gggcagagtt aagggaacga 
aaagcaggaa ggtatgagaa gagaacacac 

30 

ttgtaagaag agctgctttg tagtataact 
gtcaatgcag agagaatagg aaagcaatgc 
35 tctccaaaca aacctgaatg gtttgcttta 

attttcccag agtgacacaa ttggcaaatg 
tttttagccc tgtggtcttt ctattacact 

40 

aatgagactt ggctttggct cacagagaca 
gfcgagaagat atattgtcgg ctcattccct 
45 gcattttttt tttttggaat gaaatatatt 

cctgatgttt tctgtagctt tcttttttta 
atagttttta aataaataat ctgtacacca 

50 

gtgtaggtac caacctccca ggattttagt 
cccaccatcc tctctcttgg actactgtcg 
55 cctcatctcc cttcagtctg ttcttatcat 

actagtcaga tcctgtgttc cttttgctga 
ataaacctaa gtccttaata agacctcaag 

60 

tcagacaaca ttgccttctc ttctccctat 
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ggcagtaacc 


acctggtgtt 


gctcgagcac 


9000 


ggataataaa 


gaaattttct 


caaacgcaat 


9060 


gtgtgaacta 


ttcttctgaa 


ataataatgc 


9120 


ttcttgaacc 


tgataataca 


aatccctagc 


9180 


gaggtcttga 


ggtgagcacg 


ttccatgctt 


9240 


gagactggaa 


tagggagagg 


agaaggcaat 


9300 


taaagcgact 


tattaaggag 


atacggctta 


9360 


tttcagcaaa 


gggaagacag 


gatctaactt 


9420 


tcttggtaag 


aaccgactga 


agggaggcga 


9480 


tccggcagta 


gtcccagaga 


gaggatggtg 


9540 


tcattcaaat 


cctaggaggt 


gggtgcctac 


9600 


tcattttgaa 


ggctcaggta 


tcagtaaaag 


9660 


gacttattta 


agggattttt 


atgaagacct 


9720 


agaaggaaag 


ggaggtcact 


acagacaaga 


9780 


tattatgggg 


accaataaag 


agcagaagtc 


9840 


gtggaagaag 


tagctacttc 


caaaggctgt 


9900 


tttgtccttt 


ctctctttcc 


agcctctaat 


9960 


ctgtcacaca 


tttatagggg 


atttgtggag 


10020 


gctgaattgt 


ccggagctca 


gacctgctgc 


10080 


acctggtccc 


cctccagaga 


gaaacaacta 


10140 


atgaataggg 


aaagttggaa 


gtctaagtgg 


10200 


tttttgccct 


cccaagagaa 


aaaagagctg 


10260 


ttctttttct 


gatagtgttc 


cactgctctt 


10320 


aaacttttat 


tttatgttca 


gggtacatgt 


10380 


aacccttgtg 


acatgcagtt 


tgaaagacaa 


10440 


gagaagcaac 


gctgcatccc 


aacaacccaa 


10500 


gagcctcctc 


actatcctcc 


ctacgtcttc 


10560 


ggcaagcagc 


cagagtgata 


ctgttaagac 


10620 


aaatcttcca 


gtgcctctat 


ttcactcaga 


10680 


ttcttttata 


attgaggccc 


cgcagacacc 


10740 


cccaatctca 


atgtgtctcc 


agacacactg 


10800 
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tcccctcaga 


agcacaggaa 


cctgctcccc 


5 


ttcaaataac 


tgctagtgag 


ctctaaggat 


ttacatgcca 


cagaggtttg 


gtgtaccgat 




ctcaatagat 


agttttttaa 


tccttaccct 


10 


agtgtctgtt 


tttcccttct 


ttgtgtccat 




gtgagaatgt 


gtgatattta 


gttttctgtt 


15 


ccagctccat 


ccgtgtttct 


tcaaagaaca 




ttcccatggt 


gtatacgtac 


cgttttttcc 




ttcatgtctc 


tgctatgaac 


atatgtggac 


20 


cctttgggta 


tatacctaat 


aatgggatag 




ctttgagaaa 


ttgccacact 


actttgcaca 


25 


gtgtatggtg 


ttatttttcc 


tccacaacct 




ataatagcca 


ctctgactgg 


catgagatgg 




ttctctaatg 


attagtgatg 


ttgagagttt 


30 


cttcttttaa 


aaagtgtctg 


ttcatgtcct 




ttcacttgtt 


aatttaattt 


ccttatagat 


35 


gtttgcacat 


actttctccc 


attctgtagg 




tgctgtggag 


aagctcttta 


attaggtccc 




ttgcttttga 


catctttgac 


atgaaatctt 


40 


ctaggttatt 


tttcatggtt 


tttataggct 




attagttgat 


ttttgtatat 


agtgtaagtt 


45 


ctagccagtt 


atcccattac 


catttattaa 


gctgactttg 


ttaagtatca 


gatggttgta 




tctgttccat 


tggtcaatgt 


gtctgttttt 


50 


agctttgtaa 


tatagtttga 


agtcaggtac 




taggatttcc 


atggctattc 


aggctctttt 


55 


tttctaattc 


tgtgaagaat 


gtcattggta 




ttgctgttag 


cacaatgact 


attttaataa 




tgctttttta 


tttgtctgtg 


tcatctctga 


60 


tgtagagatc 


tttcacctct 


ctcattaact 



tcaagagctt tgcactttcc acttctttgg 10 860 

ttgcattatg aggttcaaga aaataaatgc 1092 0 

tattttatca cccatgtaat aagcatagta 10980 

cttctcaccc tctaccctcc agtaggaccc 11040 

gtgtactcag tgtttagcta ttactcataa 1110 0 

tctgcatgag tgtgcttagg ataaaggaat 11160 

tgaactcatt ctttcttatg gctgtgtagt 1122 0 

agtctaccat tgatgggcat tcagtatgat 11280 

atgtatcttt atggtaaaat aatttatgtt 11340 

ctgggtcgaa tggaacttct gttttaagtt 11400 

aggctgaact aatttacatt cccacaagca 11460 

ttttagcatc tgttactttt tgacatttta 11520 

ccacagtctc cttgtagttt tgatttgtga 11580 

tttaagtata cttgttggct gtgtgtatgt 11640 

ttgcccacct tttaatgggg ttgattgttt 11700 

tcttgatatt agacctttgt cagatgcata 11760 

ctgtctgttt actctgttga tagtttcttt 11820 

acttgtccat tttttttttt ttttgtgcaa 11880 

ttccagggcc tatgtccaga atggtatttc 1194 0 

tagattttac atttaaatct ttaatccatc 12000 

agggatccag cttcaatctt ctgtagatgg 12060 

atagggatcc tttccacatt gcttgttttt 12120 

ggcatgtcac tttctttctg ggatctttat 12180 

gtaccattac catgctgttt tggttacagt 1224 0 

catgatgcct ccagctttgt tctttttgct 12300 

cagttccata tgaattataa aatagtttat 12360 

gtttgatagg aacagcattg aatctgtaga 12420 

tattgattct tcctatccat gagcatggaa 12480 

ttactttgaa cactgttttc taatcctcat 12 54 0 

aaattcctag atattttatt cttttttttt 12600 
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ttttggccta ttgtgaatga tttggctctc 
tactgattta tatacattaa ttttatatcc 
5 taagaggttt tgggcagact atggggtttt 

gagatagttt gacttcttct tttcctattt 
ttgctttgtt tagaacttcc agtactatga 

10 

ttgtttcagt catccaggag aactttccag 
tggtttgcca tagatggctc ttattatttt 
15 agtttgttga gggtttctta acatgaagac 

atctattgag atgatcatgt ttttagttct 
accttacgtt aaaccaacct tacatccaag 

20 

gattttttat gtgctgctgg atttggtttg 
ctgttcatca aggatagtgg cctaaagttt 
25 gtgtgtgtgt ctgccagatt ttggtatcag 

gaagatcctt tcctcctgaa tttttggaat 
tatacatttg gtagaattgg gctatgaatg 

30 

ggctttttgt tgtgatttaa tttcagaact 
tcactggttt actcttggga ggttgtatgt 
35 ttctagcttg tgttcataga ggtgttcata 

gggattagtg ataatgttgc cattttcatt 
ttttctttat tagtctagct actggtctat 

40 

tcctggattt tttaatcttt ttatggtttt 
ggtactggtt acttcttgtc ttctgctagc 
45 gtcctctgtg tgtgatgtta ggttattcat 

atttagcact atacattttc ctcttaatgc 
tgttatatct ttgctctcat tagtttcaaa 

50 

gtttacccaa cagtcattca ggaacagatt 
aaagatattc ttggtattta tttcagtttt 
55 tatgattttt tttaactgct gaggattgtg 

tatgtaccat gtgcacatga gaagaatgtt 
tagtagtctg ctaaacccat ttggtgaaat 

60 

gttttctgcc tctatgatct gtctaatact 
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agcttggatg ctgttggtgt ataaaaatgc 12 66 0 

tgaaactttg ctgaagttgt ctatcagatg 1272 0 

ccaggtatag aattatatca tctgcaatca 12780 

ggatgcattt tagttctttc tcttgcctga 12840 

aaaggagtgg tgagagtggg catccttgtc 12 90 0 

tttttgccca ttcaatatgg tgttggctgt 12960 

attattttga ggtatattcc ttcaatgcct 13020 

atgtagaatt ttatcaaaag ccctttccgc 13080 

gtttacgtgg tgaattacat ttattgactc 13140 

agataaagcc tacttgatta tggtggatta 13200 

ctagtatttt ttttgagtgt ttttgaacct 13260 

tcgtgtgtgt gtgtgtgtgt gtgtgtgtgt 1332 0 

aatgatgttg gcctcataga atgagttaga 13380 

agtttcagta ggacggtgtc aactcttctt 13440 

catctggtcc tgagcttttt ttagttggta 13 50 0 

tgttattggt ttgttcaagg tttcagtttc 13560 

ttccaggaat ttatctattt cttctatgtt 13620 

gttgtctctg aacatttttt gtatttctgt 13680 

tctgactgtg tttatgtggg ttttctcttt 13740 

caatcttatt tattctttcg aggaatctcc 13800 

tcacatttca atattgttta gtttagctct 13860 

tttggggttt gttagctctt gtttctgtag 13920 

ttgagatctt tctaactttt tgatttgggc 13980 

tgctctagct atgtcccaga gattctcgta 14040 

gaatttcttg atttctgact gaatttcatt 14100 

gtttaatttc catttaacta tgtagttttg 14160 

tattgcactg tggtctgaca gtgtgtttgg 14220 

ttatgcccgg ttgcatggtt gattttagag 14280 

tactctgctg tttgggggtg gagagttctg 14340 

gttgagttca gatcctgaac atctttatta 14400 

accaatgggg tgtcaaaatc tcctgctatt 14460 
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atcatgtggt tatctaaatc tcttcatatg 
gctctggggt ttgatgcata tgtatttatg 

5 

ttaccattat ataatgccct tctttatctt 
gaaattagaa tagcaactgc ttcatttttc 
10 atccctttac tttaagttga tgggtgtcat 

atatagttgg gtcttgcctc tttatccaat 
ttatcccatt catattcaag gttaatattg 

15 

tgtcagttgg ttattatgca gacttgaatt 
aacagagctt cccagcttac caggagcaga 
20 atgaactgct ggatgttcaa tgtggactaa 
cctggggaga ggtcaaacac ttgcatggct 
gtgtaaagaa cagagaaaaa tcttgtgctt 

25 

taaatactcc agggcattct gttgtcttta 
ttttttcaga ttctgattga ttctgattga 
30 ggaaaaggtc aatacacaac tccaccttcc 
tcagctacat ctctctctcg tatcctgcct 
tttgtgaagt tgataactga gaagcacaag 

35 

ctagaggatg tttcccctca ctctacacta 
acaggagatt atagtgaatg aactgtaaac 
40 ggagatccaa taataatagg ggagaccaaa 

gacccaaata gctacaacaa acaataaatg 
accttacata aaaggcctat ttactttact 

45 

ttacaatgta tgttaaaagg taaaagacaa 
aagattcaaa tgcggcagag attttggaat 
50 ttaatattat aaggacttta acaggaaaag 

taagcagaga gaagaaattt ctaggaaaga 
cagtaataaa aattttaaaa atgtttttga 

55 

ggaaagagtc agtgagcttg aagatacgtc 
ggaaaaaaaa tggaaaaaaa aagaacagaa 
60 gaataacatt acacataatg gaaataccat 
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tatctaagaa 


tttcttttgt 


gaatctgggt 


14520 


acagttaggt 


cttcttgttg 


aattgaacct 


14580 


ttaaaatcat 


tgttaaaatc 


tgtttggttt 


14640 


tgttttcttt 


tgcttggtag 


atttttctcc 


14700 


ttcttgtgaa 


atggatcttc 


tgtggacaac 


14760 


ttgccacttt 


gtgcctttta 


attggtgcat 


14820 


atatgtgcag 


gtttgatctt 


gacatcatgt 


14880 


tttgtcccga 


aatttggaca 


ggaaggccca 


14940 


agcacaatag 


gaacacatca 


attataaccg 


15000 


cttgggagct 


gaaactccat 


gaggcccagt 


15060 


tttacctcaa 


agtgtcctag 


caggtattta 


15120 


ctgattaagg 


aaggtaatat 


aaccattttt 


15180 


acaaaaaaag 


tcttccctca 


agggaaacta 


15240 


tttggataat 


accaaaacct 


aaccagtctt 


15300 


tctatccttc 


cacgtgggga 


aactgaatat 


15360 


gacttaagtt 


caggtagggg 


ctaagaagca 


15420 


ttgctaaaag 


gctgagacct 


aattatgata 


15480 


ccatcagatg 


aatcaggatc 


ctgtatgata 


15540 


ctcagaccct 


atttaagaag 


tggtgtctag 


15600 


acaaggacat 


tacaggacat 


ttcagccttt 


15660 


cagcctaact 


tctagccaga 


taaacataat 


15720 


tccttttatg 


tccagctatc 


aaaaaaaaaa 


15780 


agtgtgaaga 


aacaaagcaa 


gcaacagaac 


15840 


tatcggatca 


gaaatgtaaa 


ataactatga 


15900 


catacaacat 


acatgaataa 


acagatatga 


15960 


atgaaaaggt 


aatgctagaa 


atgtaaaata 


16020 


taatttcatc 


agtatactag 


gcatgtcaga 


16080 


aataaaaact 


tccgaaactg 


aagagcaaag 


16140 


caaccccaaa 


gtgcaggaca 


actacaaaaa 


16200 


aaaaagaaac 


aagaaagaaa 


ggaagagaag 


16260 
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aaatatttga agtttaaata atgactgaga 
cacaaatcca ggaagctcag aaaataccat 
5 ggtatataat attcaaaaac agaaaatcaa 

aggagaaaag gaaacttaac tagagaggaa 
cagaaactat gcaaacaaga agggggtgaa 

10 

gttcaggcac cagtcacact tccttgttac 
cagccctgga gttctgccta tgtgtgttgc 
15 ccattttaag agtgccagca tgttccatag 

tgaattaata tcctcatatt ggaagaaaaa 
acatggtagg cagtatttat ggcaagggtg 

20 

tgagataatc agatgggaag aacgaaccag 
gtccttagaa cgtacctatt ctaactctga 
25 cattgagttt agtttctgag ttgagattaa 

ccttactgtc ttcagccact ttctttctca 
atgtgaggag acttcatgta ttctctgtgt 

30 

cgtgctccct cttagctctt aatcttatgg 
aagagccata atcttgttga atcaaagtcc 
35 tttaattcat ccactgtttc attaaagcaa 

tttgcttgga tttctgaata tagtacaggt 
taattcaaag gagagactgt aaaaaatata 

40 

actggcattt ttttagcttt ttgtggaatt 
aatccggtgt attagaaaca tgactttagg 
45 tacttgccag agatagttgg tgattattat 

gagattagac atctctataa ttgtgggcct 
aaagcttttg tttcctttga taacttatat 

50 

gtaagtcatt ttctaagagt ttcagattaa 
tgcagagcag taaattggtc agagctaaga 
55 tgatttgtat acactacttt ggtgtcagca 

accctgtcct taggggacac tggcatttat 
tctgaggtta agtgttaaga attttaagag 

60 

tacatatgta ctatttgaag aaccatttct 



PCT/US01/48437 



tttttccaaa 


attagtaaca 


gatgccaaat 


16320 


gcatagtaaa 


gagtaaaaca 


tctatacata 


16380 


agacagagaa 


aattttgaaa 


gaagcaataa 


16440 


caaggataag 


tattacatta 


gtcttctttt 


16500 


atgaagtatt 


taaaatgttg 


gaggttaaag 


16560 


agtagacact 


ggttgagatg 


ttaggtcttt 


16620 


tatgttgtac 


acaggtcaac 


tcaaactctt 


16680 


agctcagtat 


atctcaaggc 


ctgggatttg 


16740 


tactcatgaa 


atgtggtggt 


gatgagatac 


16800 


gtgcttaaaa 


aagctggttc 


atattggcaa 


16860 


gtgtagaatc 


actggagatc 


aggccggctg 


16920 


ggatgaagaa 


acaaatatta 


ctctacatcg 


16980 


aacttggatc 


atctcactcc 


cagcgatgta 


17040 


gacattcttc 


acagtaaata 


tttcttgcac 


17100 


cctggttttc 


ccttagtatc 


attgatcatc 


17160 


ttttaatact 


tttttcctag 


aaaccataac 


17220 


gttggcctat 


caaaatgttt 


ataacatagt 


17280 


attataatga 


gtgactgctg 


gagaatagta 


17340 


agcctggaat 


agctaaggaa 


tctgtgttca 


17400 


tcaaaacact 


aaagctggtt 


ttgaataact 


17460 


tattttaact 


gcctcaaatt 


cacaagataa 


17520 


acacttgatg 


aaatgaaaat 


gttgggaaat 


17580 


cctctgtgaa 


actcaatgtt 


caagggaagt 


17640 


ttagaaggca 


tgtgatgaaa 


taaaaaaaag 


17700 


atttgttcaa 


tttgactgac 


ttgattggct 


17760 


aacttaattt 


cattagttgg 


caaaaatggt 


17820 


gatcttgtaa 


tgataaagaa 


gagaagaatg 


17880 


tagacgagtt 


gggctattgt 


cattaaacag 


17940 


tttgcatatt 


aatatacata 


ggcaccagta 


18000 


tgttttctta 


agaataatac 


ctttctttgc 


18060 


taagcacaaa 


ggaaacccat 


atgggaaaaa 


18120 
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aaaacatttt gtgcttttaa tttagaatta 
aggatacaaa ataatatttt ttctagattc 

5 

cacctcaact ttacatgact cctacttcct 
tctaatcttt tattccagat atctctaagg 
10 atataatttt ctctgatact gtatctcagt 

attctctgca ttgtttctat tttacccacc 
ggtgtcagac attgagaaat gttgaaatca 

15 

cagagataca atgaactggt aagaaaaaca 
taatcttggt ggattaaaca catctatttc 
20 ggtctgggag aaagtcataa atccaaaaat 
ttttttaaat gttagaaacc agctagacaa 
tgaaaacctg actgacggaa ttggaagctt 

25 

cagtaagtct caagaattat aagtgctttc 
gtctgagaag gtagttgtat tagtatgttc 
30 tgggtaattt acaaagaaaa agaggtttaa 
cctcagaatc atggtggaag gcaaaatgca 
gagagccaag caaaagggaa aaccacttat 

35 

ttaccacgag aaaagtatgg gggaacacgc 
cctcccacaa catgtgggaa tgatggtagc 
40 ctgccaaacc atatcaatag tgaatactta 

aagatccctg aaggcctaga gtataaccag 
caggaaggga gagtgtatac agaaaaatac 

45 

atagacactg agatttacat ttatggtgga 
agaaaacagc atatgaaaca tgaggcagtt 
50 gaagcgtaat cataacactc ataatgaatt 

agttaactaa aaaatcgtga taaaactata 
gtgggaaggt gtgtccaggg taaaaatgaa 

55 

agtaggaggt tagtgcataa tatataaaac 
acttagaaat atggagtcaa cacaaaaaga 
60 agaaagaaaa ttgattggga actggggaag 
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tataggcact 


ggaaaaatat 


ttttattaaa 


18180 


aaaatgagga 


attacagtgc 


ttagatctat 


18240 


aatatatttg 


ttttcacatt 


ttccctctgt 


18300 


cttttaccaa 


tctctttagc 


tgagattgta 


18360 


tcactgctga 


ctgatgtgcc 


acctacatcc 


18420 


caatagcatt 


caacagaaat 


gttcctacct 


18480 


aaaagataaa 


taagatacat 


tatctgtcct 


18540 


cttataaaaa 


tataattttc 


aaataatggc 


18600 


ctcatgaaac 


ccaactaaat 


gactataaag 


18660 


tcaaaaataa 


tatgagagaa 


attggatcat 


18720 


gcaataaatg 


acttagcagc 


ctccggaaag 


18780 


agaaaaaatt 


tgattctctt 


tgtagaaccc 


18840 


agttgaatat 


gtctggggac 


acagctaatt 


18900 


tcacattgct 


aaaaaagaca 


taaccaagac 


18960 


tggactcaca 


gttccacatg 


gttggggagg 


19020 


catcttacat 


ggcggcagac 


aagagagatt 


19080 


aaaatcatca 


gatctggtga 


gacttattca 


19140 


ccccatgatt 


caattatctc 


ccagtggttc 


19200 


tacaattcaa 


gatgaaattt 


ggtggggaca 


19260 


agtaagtgaa 


ttccctgaga 


atagtgaatt 


19320 


tcctaactgg 


aataggagaa 


tataaggcgc 


19380 


taatgaaaca 


gatcaattac 


ctgtgggtct 


19440 


gtatttaaaa 


caacttattt 


tcaggtatgt 


19500 


actaacttca 


gagagaaaaa 


tggtcaaaag 


19560 


agccattaat 


gatttaaata 


ctataaattg 


19620 


tgggagaatg 


agggaaaagg 


aagtgttatg 


19680 


ggatataggg 


gttagtctat 


tgtcttcagt 


19740 


caaaaattag 


aacagtattt 


aaaatcattc 


19800 


aacaattaaa 


atttaaatac 


ttgcttctgg 


19860 


ggggcactaa 


tgatttcttt 


caggctaaat 


19920 
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ccagtgaaca 


ttttactgtt 


gttagcttgt 




atttcttcct 


actgaaatat 


ttctgtgctc 


5 


ctaatataac 


cctttcaaca 


acgctatgac 




aactgaggtg 


cagaaagcct 


aagcaacttg 


A f\ 

10 


atccagctca 


aggccgggtg 


cagtggctca 




aggagggtat 


attacttgag 


atctggagtt 




cagtctctgt 


taatatataa 


tataacatat 


15 


atatattcta 


ttataatata 


atatatattc 




tattatatat 


aatatatctg 


tatatgatat 


20 


tatatataat 


aggatatata 


ttctatatat 




catgttattg 


ttctcttgaa 


attgttatct 




ataccatatg 


gagtgggagg 


agagactgat 


25 


cttaggaagt 


catgaaactc 


tgtacccttg 




cctgctatct 


tccttaccct 


ctttttcata 


30 


ctgtctctat 


cctctagaaa 


aagggtttga 




gacctcatcc 


ctctgggcga 


agagctgttg 




ctagctcctg 


ccagttatcc 


acttcttgtc 


35 


gatcaaatac 


gttcaaaatt 


ttactatttg 




ttgccttact 


gaatggctct 


atcactcatt 


40 


cctttcctgt 


tttttctcct 


gatttacaca 




tgttcctaag 


catctttcaa 


atccattcat 




ttcagtaacc 


atcatccctt 


ccctagagaa 


45 


tttattatta 


tttattcctg 


ttgtctgaaa 




tttaattttg 


agaatcgtaa 


tttcagatta 


50 


actggatttc 


aatctcatct 


ataccagcta 




tctgaaaaat 


agaaataata 


atattacctt 




tgctcattca 


tcaggtctcc 


aattagatgt 


55 


ccaatgttgg 


gttcattgat 


cctcctatga 




agatcactga 


ttacaatggg 


ttgtaatgat 


60 


ttgcagataa 


ctgatagtaa 


tggattgtag 




agtctataaa 


ctctttgagt 


gaaaattatt 
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ctctctgaag 


tatttaataa 


taattgaatg 


19980 


aaaaaatgat 


aagtaattta 


tttatattac 


20040 


atattactat 


ctatatttta 


cagatgagta 


20100 


ctccagctca 


catagggcat 


aggattcaaa 


20160 


cgcctgtaat 


cccagcactt 


tgggaggctg 


20220 


cgagaacagc 


ctggccaaaa 


tggtgaaacc 


20280 


tatatagaga 


tacattctat 


tatatatata 


20340 


tattatataa 


tatatattat 


atattatatc 


20400 


ataattatat 


ataatatata 


gatattctat 


20460 


atagaatata 


tatatttctc 


tgtcaaattt 


20520 


gctaataata 


acctgatact 


gtctcatttc 


20580 


gcctgtcctt 


agtccagaat 


ttcaacctct 


20640 


tcttctgcaa 


tgccacattt 


tccgagtttg 


20700 


atcctctggg 


aaatcttatt 


tctgattata 


20760 


cgttttctgc 


ctccacccac 


aagcctgggt 


20820 


ggtttacctc 


tatagccatg 


ttctcattcc 


20880 


aactttattt 


gaaggtccct 


acacaactct 


20940 


ttcctgcagt 


tctgtcctgg 


tccatgtcta 


21000 


cagtttccta 


acccagaatc 


tgagtgtcat 


21060 


tccattatac 


caacaagtac 


tgtcatttct 


21120 


tctctccaac 


ttcactagca 


tcacaggtag 


21180 


ctgcctctac 


cttgagccta 


attctctgat 


21240 


ccaggacaat 


atttctgaaa 


tataatctat 


21300 


tagaccacat 


agtttttgga 


attaagtaaa 


21360 


tacgaatatc 


tctgagtttc 


agtttcttct 


21420 


ccttcattcc 


ctttccattt 


gtgtgtcttt 


21480 


ccccttatct 


aggaagtctc 


ctctgcctac 


21540 


gctctcttag 


gatatcctac 


tttaccttgc 


21600 


gtatttactc 


atctgtattg 


ccactttacc 


21660 


tgatttattt 


actggtctgt 


ttttccaagt 


21720 


ttttaccatt 


ttttcagaag 


cacacggcac 


21780 
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actgcttggc ttatagtagt ttttctaaag 
actttgcttt cctcctccat gtaaagtagg 

5 

ttcaggagat taacagggcc atgcaagaaa 
accagctcgt tctatgatac cagctgcttc 
10 atacaaccaa cttgttgcca ttttatggcc 

catgtaaata aaacatttaa gttgtgtata 
ttggccataa aataatggca aaaactgcaa 

15 

atagtagtgc tccaggaaga caagagtact 
atttacaaaa taccatttac ttgtttatat 
20 taaatcctct accactgata tccacagtaa 
tatcaggcag cccctgcttt ttcagcaaat 
aacacaaaat cagttaatgc ttccgatata 

25 

tctcagcaca gcctttaagg ttccaaacat 
agtgaggtgt ggagtggggg aagttgattg 
30 cccagagccc tgcatgcatc agacctacag 

ccagggactt gccttcctag gctttcgggc 
atttctggga ggggaatgag aagaataatg 

35 

cagggttggg gtgggttgag tgcaccttta 
gtttttgcta cagtcgatag gagttagaaa 
40 ataaggttgt ttagttactt ttgtcccctg 

acattttaaa cctttgacta gttacctcat 
tattctctgt gtctgactgg ttatttttat 

45 

tgcagaattt ggtgtacatg gagtagggca 
atgggctgtg gcctgaccta gttctccagt 
50 gcagtaaact gaggctgctc atgataaaat 

agcattctta ggataagaaa gaagcttgta 
gatgaagtaa aagaagccaa taaaagggaa 

55 

aaagggctgg gagaaaaaca aaaacaaaaa 
gcctttcccc cttttccctt cccatgccac 
60 ccatccagat tcaggcctca ttttgggttt 
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aaaatgctga 


atgaatgaaa 


actgcacacc 


21840 


cattatttta 


ttttacaaat 


gaatcttagg 


21900 


tgtttgccat 


tggtttgatc 


ttagtcaaag 


21960 


cttcacttag 


aaaggtgttg 


ttttattgcc 


22020 


actcctgatg 


aatctctaaa 


ggtggaattt 


22080 


ttaaggttgg 


tgcaaaagtt 


aattgtgatt 


22140 


ttacttttgc 


accaacctaa 


taaaatagta 


222 00 


gtggaaatgt 


gtgaccctgt 


gaaagggggt 


22260 


aacgactttt 


gaagagagtc 


tggcatacag 


22320 


ttagcagtat 


tattttcata 


ggaaagactg 


22380 


ttcaaaatag 


acatccacac 


ccatattggg 


22440 


aaagaggaac 


tagctgtgcg 


tctcagaaat 


22500 


ctgctagaag 


aggaatgcag 


atttaaactg 


22560 


ggtctagacc 


aaagaacttt 


gaggaacttg 


22620 


cagacattgc 


aggcctgaag 


aaaggtaggt 


22680 


tgatggcttg 


tttcacttcc 


tgggttaaga 


22740 


aggccacttc 


ccatgaccca 


agctggtatt 


22800 


aatgtgcatt 


cggtgggggg 


atgaggaaca 


22860 


agcctcctaa 


aggctgacat 


tggaggctgg 


22920 


agtcttcagg 


atatcaaaac 


actgaaagca 


22980 


ctactgttaa 


tgcccttctt 


ctgtggtctt 


23040 


aactcaggat 


ttaagaacag 


atgtatcaca 


23100 


caggcagggt 


ccccagcctt 


gtacatttag 


23160 


ggtctttggc 


actattcatg 


gggcacccag 


23220 


ctgctaatga 


aaaacacctt 


gtagactttt 


23280 


agtggtcgtg 


gccattgtaa 


tctttcccca 


23340 


ataatgtacc 


cagaatcaca 


caagttaaca 


23400 


caaaaacaga 


attcttggtt 


cctagcccgg 


23460 


cacttctgca 


tccccatcat 


ccttctaaaa 


23520 


ctagtctatt 


tgacaacttt 


gccaagaaaa 


23580 
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acggatggct gtttgtacca ctctcctcat 
aggatggtga cataaaaatg aggaaataag 
5 actggggcca atagaagagg gaagggtggt 

tgggggaaag acattgtatt tcttaagagc 
aggtttaaat tcttctccaa cctggaagaa 

10 

cagttaaaaa aaaaattgaa tatagagaag 
tcttggggac ttcatgtgcc gtagataatt 
15 gaggcagtta tatgaattta tggccagagt 
tgaccagaac ctgacctatg ttaccaaaga 
gctttatttt gatataattg ctctgccctc 

20 

ctaaacacag agagagaacc accagtgcgt 
tctcagtctg aaggtgtgcc tgagagattg 
25 taaagtacac atttagatgg actaagaaga 

gacttttgca atgtagaagg aacataacac 
ctgactaatc cacatgtggg caatggtggg 

30 

agagactata aaatgctgtt catgtgtgag 
gttcttacaa ggtggtcaca agaggggtgg 
35 agatgtctcg atataggaat gaaattacgt 

cttggagctt taaaaattaa aaggagagag 
tgaagtagcc gagcttttgt tatcaacatt 

40 

aaatataaaa gaaggctttt aaaggggatg 
ctggctacaa catcagtggt ggattgaata 
45 tcagaggcta taatgaatac ttctgataaa 

ggagaaatac acagagttga aaaatggttg 
agagggtagg caatggaagg aaccaatgac 

50 

cactgtgttg cccaggctgg agtaccctga 
cagcctcctg agtagctggg attacaggcg 
55 tttgtgtaga gatggggttt caccatgttc 
gcggtcctcc caccttggcc tcctatactg 
agctttaaac ttccagattt ttaacatgga 

60 

agaaaggaca cttgtaattt catagccctc 
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gctctctcct 


tgcaattgtg 


acccatgagg 


23640 


agactggtgg 


tgctgctgat 


ccacaaggtg 


23700 


ctgggtttcc 


ccacctcttt 


atcaggggtg 


23760 


ctgatgcata 


taggtgatgt 


ggcaaaacca 


23820 


atgtccctca 


tatgcatcca 


atattctcag 


23880 


tcaagccctg 


gatggtaaac 


acgtttttgt 


23940 


aacacaagga 


tttctgtggt 


ggaaaaagag 


24000 


ttcttagaat 


gacaaagcac 


cagccttgaa 


24060 


attatgggaa 


agcataggag 


tcagttacaa 


24120 


gaggaagctt 


acagtcttac 


cagagaggca 


24180 


ttgataagac 


agacccaaca 


tgtcctaaat 


24240 


aggcagaggt 


tcttttacat 


taaagaagaa 


24300 


atcatcttta 


gtcactttcc 


aaataacttt 


24360 


atttcttaaa 


cccttattta 


ccgtcatagg 


24420 


gaagtggggg 


aatagtgtct 


gagagaagat 


24480 


aagattagat 


gatgtggtgg 


gtttgttttt 


24540 


aacattcctg 


caaatggttt 


caatatatgc 


24600 


ctttggaaca 


acttaaataa 


gtcaaatata 


24660 


attcgaggta 


agactacaat 


agatttgtgc 


24720 


aaaaatttta 


actttatgct 


aagggtggtg 


24780 


gtgactcaat 


tgtttttgta 


tttgatcatt 


24840 


tccaaagcca 


caaatggaca 


cagggaacat 


24900 


aatagaataa 


tggcttgcct 


tgagaagatg 


24960 


tgtaggatta 


gaatgagatt 


gattgagcat 


25020 


aaactatcag 


atatttttga 


gacagggtct 


25080 


tgccattgtg 


gcttactgca 


gcctgggcct 


25140 


cccaccacca 


cacccagcta 


atttttgcat 


25200 


cccagtctga 


tctcaaactc 


ctgggctcca 


25260 


ctgggattac 


acrqattqaQt 


caaaatgccc 


25320 


tagtgtagtt 


aattgctgga 


ccttccacca 


25380 


ttagggttcc 


tgttcccttt 


agctcttgcc 


25440 
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actcttgatt 


agcaactact 


tacagggctc 


5 


ccaccttact 


tttttgaaag 


ggaaattttg 


tctaaggttt 


gaaataaaaa 


taacctcttc 




atttatgggc 


tttattagaa 


tgactttaaa 


10 


tataacaaaa 


tttaaaaata 


tggattaaat 




tagccatatg 


ctccaaatct 


aaattttgtg 


15 


taggtctgaa 


tatacctact 


ccatcgcata 




tcattggaac 


agagactgca 


agaggaagac 




ttgttatcta 


acttattcca 


aattgggtgg 


20 


tgctaagaaa 


at taattgta 


caagt ttttg 




tagtttagtg 


gatcccctta 


agcaaaggag 


25 


ttggccctgc 


cactgtttac 


tcaagttact 




gaataataga 


atatatctct 


caggggaatt 




aaaccagtgc 


gtgtcactta 


ataaatgcta 


30 


ctactatcac 


t tac tcatt t 


ctggc ttgaa 




ctgccactca 


aggacatgct 


agacagggac 


35 


tctccggtct 


tggggaaaat 


agaaaagcag 




atgtttttca 


tggtaccatg 


tggtcagagg 




caaagattgg 


tcagaagagt 


caggattgtc 


40 


aagac tatgc 


aoaQQCcrcrac 

a z)^z)z)^ziz)z)^ 


ctagaggc tc 




gataatggag 


tttatgaggt 


gggttacatt 


45 


tatatcgaca 


ctttagcatc 


ctctgagcag 




taaaataaaa 


tagcaggcct 


tattttttgt 




ttaaacatca 


gagaactgtc 


cccctctggg 


50 


taaattaaca 


atttaccaac 

W V* u. u w 


taccccaaaa 

W w "w 1*4. V* 




tagaagacct 


aaatagaaga 


tattcggcct 


55 


gtgttacagg 


aaaagtatgt 


ggaaattatt 




agtggcaaga 


actcaacgta 


acagaaaccc 




tcttggtgga 


aggcccagca 


attaactaca 


60 


tttgggcatg 


gtggctcact 


cttgtaatcc 



tgaaataatc tccacatagg aagatttgcg 25500 

aggaagactg gatgtcttct atttaggtct 25560 

tcccttctgg gacaagtagg taaattgtca 25620 

ttaagcccag agaaggacaa ttcccttaag 25680 

gtaagaccaa aaataagacc tattttattt 25740 

ttctgtctag gaagtgctaa agtgtatata 25800 

taaatgcaaa ccacctcaca aattccatta 25860 

aagggcctct tggcccacct ctacgtagtc 25920 

tttctacctt ttagctatta tagataatgc 25980 

tgcaaacaag agatcttcca ttcaccagac 26040 

ttctggaatt acatcacctg gatttaaatt 26100 

taatcttgtg tttctataat tctgtcctgt 26160 

taatgatgaa ataacacata taaaacattt 2 6220 

aatacatatt gactattatt actggtattc 26280 

gtataaaaca taaacttcct ttaacacttc 26340 

tttactcctg tcatgaccct gctccaacct 26400 

ataagagggg tctatataat tggtgagagc 26460 

ttgactcttc ctctcttttg cagtttgtct 26520 

ctcctttttt gctctcagta agttagatac 26580 

tcctcttcct cttgcagtct ctgttctaat 26640 

cacatgctat ggagtaggta cattcagacc 26700 

aacacaaaat ttagatatgg atcaaagggg 26760 

cttaaattta atttgtactc gcaaattttg 26820 

cttaatttaa agtcattcta ataaagccca 26880 

gggagaagag gtaattttta tttcaaacat 26940 

tcctcaaaat ttctctttca aaaaagtaag 27000 

tcagaaccct acaagcagcc tctaatcaag 2 7 060 

taagagggct atgggactgc aagtgtcctt 27120 

ttgtctgtgt taaaaatctg taagtttaag 27180 

cagcactact ggatgctgag acgggaagat 27240 
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cacttgagcc ctggagtttg agaccagctt 
aaaatacaaa aattagctgg gtgtggtggc 
5 ctgaggtcga ggctgcagtg agtcacgata 

atgagaccct gtctcaaaaa agtctggtag 
aacataggtt cttttctatt ttccctaagt 

10 

gtacaaaaga gagaaatttt acagctaggc 
accgtgatgc ccccgagctg caaaaccagc 
15 gaatgtacaa atagggagtg ggtcacagag 

acaagggcag aagggcagag caaggtcaca 
ggttccatgt cctgctgtgc acacattgtc 

20 

tcaagagcag agaaccagtc tgactacaat 
agcctgaggg tgctgcagga gaccagggtg 
25 gcagacaccc ccagagcggc tgtccatagg 
gttattcctt actgggaaaa gaattcagtg 
aaaaatatga ctttgttctg cccggccccg 

30 

tgtttcctga acatcgctgt tatcctgttc 
caaacacaca tgttttacaa tttgtgcagt 
35 cgtacatctt cagcttacga agatgacggg 

gaaattataa aagtattgat tggggaagtg 
attcagagat tgcagtaaag acaggcataa 

40 

aatgaatgtc catgaaatct tcacaattta 
tccgttcggg gtccctgact tcccgcaaca 
45 accctctatg ctcagtcaat cattaaatga 

gacttgaagt ttattttaca tgatctaagt 
tgcatggaac agtccatccc ttaactttca 

50 

ctcttacagc atctggttag atcttgtttt 
tgttcaaaga tttaattcat ttatatttaa 
55 tgacatttta ttattttggg ggggtttgtt 

tggtctacct ttgtgattta gtaattttat 
tgtgtatctg ctgtagttct ttgccttgta 

60 

gttttataaa ttgactattt taaggtgaca 
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gggtaacata 


gagaaacccc 


atctctacaa 


27300 


acaggcctgt 


aatcccagct 


acttgggtgg 


27360 


gtgccagtgt 


actccagcct 


gggcaacaga 


27420 


tttattggag 


gaaccagtcc 


ccaatatttc 


27480 


gtcagccagt 


ctgagaaata 


aagagaatga 


27540 


ctccgggggt 


gacatcacat 


atcagcaggc 


27600 


aagtttttat 


tagggatttc 


aaaagacgag 


27660 


atcacatgct 


tcaaatggca 


ataaaagatc 


27720 


aggccagggt 


gaaattagaa 


ttactaatga 


27780 


attgataaac 


atcttaacag 


gaaacagggt 


27840 


tcaccaggct 


ggaatttcct 


aatcctagca 


27900 


tatttcatcc 


cttatcttca 


atcgcataag 


27960 


caccccctgg 


gaatgcattc 


tcttcccagg 


28020 


atatttctcc 


tacttgtttt 


ctgcaataag 


28080 


caggcagtca 


gaccttatgg 


ttatctccct 


28140 


tttttcaggg 


tgcccagatt 


tcatattgtt 


28200 


taacgcaagc 


atcacagggt 


cgtgaggcaa 


28260 


attaagaaat 


taaagtaaag 


acaggcatag 


28320 


ataaatgtcc 


atgaaatctt 


cacaatttat 


28380 


gatattataa 


aagtattaat 


ttggggaact 


28440 


tgttcttcag 


ccatggcttc 


agccggtccc 


28500 


gtagtttgtt 


attaatgact 


tctattaccc 


28560 


tgaccttctt 


tgcctctttt 


gacagttttt 


28620 


acaggtaatt 


attctcttta 


ggctagtctt 


28680 


actatgtttc 


ttcttaaggc 


taaagtgagt 


28740 


ttaaaaattt 


tccagccgct 


ctttgacttt 


28800 


ggttagcatt 


aatggataag 


gatttactag 


28860 


tttgtagata 


tttgtttctt 


tcttgtgctc 


28920 


gaattgctaa 


gctttgaatt 


ctttaacgtt 


28980 


ctcatcatta 


ggcttacata 


aaaccacgta 


29040 


ataacaatgt 


ttgttgaaac 


aagaactcta 


29100 



-132- 



WO 02/062946 

gacttgtact gtcctcccac aaaactaaat 
aaaaccattg aagactgggt aatttattat 

5 

ctgggaagtc taaaggcatg catggtgtaa 
agtggaagtg agcgtacaaa gcagagagag 
10 agtccactcc tgtgataact aaccttctcc 

gtagaggtct cctgatgtaa ttacctcttg 
gcaattaagt tttcaaatga gttttagagg 

15 

accctccaaa actcatgttc ttctcacata 
caaaattttt acccattcca gcattgactc 
20 gcagatatgg atgagactca aggtatgatt 
cctgtgaaat caaaacaagt gatttgtctt 
agactttctc agttcaaaag aaataactag 

25 

agtccaaaac ccaaccaaga aaacagcatt 
nnnnnnnnnn nnnnnnnnrm caccttttct 
30 catgcaagga atggaacagg ccatgccagg 
catggctgtc atacattcac atctgtggaa 
acccaaagtc cttggggtaa gttgcaaatc 

35 

tatttcataa gacctaatgc caaatacact 
ccctcacgac gtcagtaatt acttttcagg 
40 gtcacaacat gcttccagtt gagttatatt 

ggttgtgggt catccatgat aattgtattt 
aggccaccta tgcactacta gctgacgccc 

45 

cacatattgc tactatttct ttctggcacc 
aatctatcct ctgagcattt ccatctggag 
50 ggattaaaag aatgacagaa tagggcaggt 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
cttcaaacca cggngaaaat gtcactgtgg 

55 

ttttctggag ggggggaaat atagttattt 
tactaggttt gtttgtttat ttatttttaa 
60 tcatccaggc tggagtgcag tggcacaatc 
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ttttgatatc 


tttgttttct 


gctgctgtaa 


29160 


aaaattgtat 


tttgcttata 


gctctagagg 


29220 


acctctagtg 


aggataatcc 


catgaaggaa 


29280 


agatggggcc 


aaatttattc 


ttttttcagg 


29340 


aaggataatg 


acattcatcc 


attcatgcag 


29400 


aaggtcccac 


ctcttaatac 


tgttaaaatg 


29460 


ggacattgaa 


accatagcat 


ctcatctaca 


29520 


caaaatacaa 


tacttctgtc 


tcaatagcct 


29580 


taaagttcta 


agtccagagt 


cacatcagaa 


29640 


tatctggagg 


caaatgtttc 


cagctgtgag 


29700 


caaaatataa 


tggtgaaaca 


ggtgcagaat 


29760 


gcaagaaaaa 


ggtgtaagtg 


gtttcaagtg 


29820 


aagctttann 


nnnnnnnnnn 


nnnnnnnnnn 


29880 


gctgcccacc 


ttttctgctg 


ccatgacaac 


29940 


ggctggccct 


ggtgtgcccc 


agctgggaaa 


30000 


aggattgcaa 


gagaagttct 


tgaagggaga 


30060 


taggggaaga 


ccaatggtgt 


tgcaaactca 


30120 


gtcccactag 


cctcataata 


caacagccaa 


30180 


ctcttcatct 


gaaaaactat 


atagtattct 


30240 


taaaatatat 


tgagttcaga 


tctattctag 


30300 


aaactcagtg 


tggaaaagca 


gcaaaataaa 


30360 


cctttacagg 


ccttcctgat 


agctgacagg 


30420 


ttttcttagc 


cccctgctca 


gaggatgaaa 


30480 


tctcagatga 


gcatgcccaa 


aagaaatgaa 


30540 


gaaaggatat 


nnnnnnnnnn 


nnnnnnnnnn 


30600 


nnnntaaatt 


aggtttatca 


ctagcccaag 


30660 


atattctgga 


tccttttttc 


acattattca 


30720 


ctcataaaat 


atgttattta 


tgttggcaca 


30780 


ttttattttt 


tgagatggag 


tctcgctcca 


30840 


tgggctcact 


gcaacctccg 


tcttccgggt 


30900 
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10 



tcaagagatt 


ctcctgcctc 


agcctcctga 


acctggctaa 


tttttgtatt 


tttagtagag 


cttgaactcc 


tgacctcgtg 


atctgcctgc 


atgtatgaat 


aattatttta 


aaatatactg 


aaaat tgaga 


gcaaagtaca 


gatctatccc 


tttttaatta 


tcaacctcct 


ccaccagcat 


tgacacatca 


ttactaaaag 


tacacaaatc 



15 tctgtgggtt ttaacacaag tatagtgata 

cccaccaatc cctggcaacc acaaatcttt 
agtcatacag ttggaattgt acaataggtg 

20 

tacatgcatt taaatttcct ccatgttttc 
actgaataat atttcattgt ctagaagcac 
25 ggacatcttg atggcttccc aactttggca 
gtgcaggtca tgtgtggaca taagtttttc 
tgtaagtaag agtacgttta gttttgtaag 

30 

gtcactttga agatttcaat caacagtcaa 
caattggtag tgccagtgtt ctggattttt 
35 ctatcatgtt aatttgcatt tcccttgtga 

tctttaccat ctctatatct tctttcgtaa 
taatgaggac gtttatgttc ttattgttaa 

40 

gatcctttgt cagatatgac tcttgcaaat 
atctcttgac aatgtcttct gcagagcaga 
45 ttattctgta tttcatggtt tgtgcctttg 

caaggtcatc tagattttgg cctatgttat 
tatttatgtc tgtgattcat tttcagttaa 

50 

ttattcttac ttatggatat caagttttaa 
ttgtattact ttagctcatt accaaagatc 
55 gctttctact ctattccatt atctatttgt 

attactatca ctttatagta tgtcctaaaa 
tcttcttctt cttcttcttc ttcttcttct 

60 

ttgtgtttgc tattctgagt tttttgccat 
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gtagttggga 


ttacaggcat 


gagccatcac 


30960 


acggggtttc 


agcatgttgg 


tcaggctggt 


31020 


ttcagcctcc 


caaagtgggt 


ttatttttta 


31080 


tatgttttag 


agcagtttta 


ggcttaaaga 


31140 


atatgcccct 


tgcccccaca 


aatgcatagc 


31200 


gtaacctttg 


taccaattga 


tgaactacat 


31260 


acaatggggt 


tcattctctg 


tgttgtgtgt 


31320 


tgaatctacc 


tttgtagtat 


cacatagtat 


31380 


tattgtcttc 


atagtttgcc 


ttttccagaa 


31440 


ttttttttag 


attgcgttct 


ttcacctaaa 


31500 


ttatggcttg 


atagctcatt 


tatttttagt 


31560 


cacagtttat 


ttatgcattc 


acctactgaa 


31620 


attaggagta 


acattgctat 


aaatatctgt 


31680 


aactcatttg 


gttaagtatg 


attagtgtat 


31740 


aaactgccat 


actgtcttcc 


acatggcctt 


31800 


tgagttcctg 


ctgttctgca 


aaatctccag 


31860 


accattctaa 


caggtgctag 


tggaatctca 


31920 


cataagatgt 


ggaccatctt 


ttcatgcgtt 


31980 


ggtgtcttgt 


taaattcttt 


ggcacatttt 


32040 


gttttaaatt 


tgttgtatat 


tttgggtaac 


32100 


atcttctccc 


agtctgttgt 


ttgtcttttc 


32160 


actttttaat 


tttaagaaaa 


ttaagatgat 


32220 


gtatcgtatt 


gacaaattca 


tcaccaaacc 


32280 


ctgttaggag 


ttttatagtt 


tttagtttaa 


32340 


tttttgtgaa 


agatataagg 


tatgtgtctt 


32400 


gtactatttt 


ttaaaatact 


ttttttttca 


32460 


acttgactat 


atttgtgtgg 


ctctctccag 


32520 


ctatttcttt 


gctaatacca 


cacagtcctg 


32580 


ttgggtagtg 


ttagtcctac 


aactactact 


32640 


tcttcttctt 


cttcttcttc 


ttcttctaga 


32700 


tccttataaa 


ttttagagtc 


agtttatcaa 


32760 
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15 



tatatacaaa 


ataacttgct 


gggattttaa 


tggaaagaac 


tgacatcttg 


acaatgttga 


tccatttatt 


ttatcaaagt 


ttgtagtttc 


tatacctaac 


aaatgccaaa 


taccaatacc 


aaatggcatt 


atgtttttaa 


ttccaaattt 


aattgtctct 


tgtgtattaa 


tcttgtataa 


gttccaggaa 


tttttggttg 


attctttcag 


aacaaaggca 


gttttatttt 


tccctgacca 


ttactgtttc 


tgctatgact 


tccagtacaa 


ctgccttgtt 


cctgatctta 


gtgagaaagc 


ctgtaggttt 


ttcacaaata 


ttctttatca 



ttttcaagag tttttataat aaacaggcat 

25 



gttaatataa 


tcatgtgatt 


tttcttcttt 


gttttttgaa 


tgtatagtca 


gtcttgcata 


acaatttttt 


tgatgcattg 


ttagatttag 


tatttgttca 


tcagagatgt 


tggtctgtag 


ttttggtaga 


tggtagtgct 


agcttcagag 


attttctgta 


aataatggta 


taattttttt 


tccttctagg 


tctgatgctt 


tctgttttgg 


aataattata 


ggcttattca 


gagtgtctct 


aggagttatc 


cattttatca 


aggctatcaa 



ttttattatc cttttaattt ctatgggaac 

45 

agtaattttt gtcctctctc tttttcttag 
tacatacata tatttttaaa aactgctttg 

50 gttttcaatt ttatgatttc tgctctaatt 

agtttaattt gcacttcttt ttctagtttc 
gatcatgatt ttctaatata ttcattcaat 
actacatccc ataaattttg ataagttgtt 
ggataatggt actcttactc tgctacgttt 

60 ataccccaga cttcaagaaa atatcatatt 



tggagattgt attaaatcta ctgatcaagt 32820 

gtcttcctat ccatggacat ggaatatctc 32880 

ttatagattt tgtacatttt tttttagatt 32940 

taaatatttt atttttagag gtgctaattt 33000 

cacttattca ttataagtac ataagaaagt 3 3 060 

atcttgtaaa cctaatataa ttgtttgtta 33120 

attttctaca cagacatcat gtcgtctggg 33180 

gtctgtatat ctttatttcc ttttcttgtc 33240 

tgttgaaagg agtagtgaca gggctcatcc 33300 

tttgagtttc tccattaagt atgaagttag 333 60 

acctgaagaa gttctcctct attaataggt 33420 

tggattttgt tcaatatttt ttctgcgtct 33480 

agcctgttga tgtgatgaat acattaatcg 33540 

cctgggataa atcccaagtg gtcatggtac 33600 

tttgcagata ttttgttgag gatttttgca 33660 

ttttcttttc ttgtaatgtt tttgaatgat 33720 

aattagttaa ggagtatgtc ctctgtttct 33780 

cttaagtgtc tggtagaatt caccactgag 33840 

aaatttatta attattgatt caatttattt 33900 

tcttgtgtga gttttgacaa attgtcttta 33960 

actgtgggcg tagaattgtt cataacatta 34020 

tgtattgatg ttccttcttt catttatgtt 34080 

gctggctaga ggctttttaa attttattta 34140 

gattttattg attttgttta ttcatgttct 34200 

tctattatag ctcttcttct gtttatttgg 34260 

ttaaggtaga ctctcaggag attgatttta 34320 

gctatatatc ttcctctaag cattgctttc 34380 

gaaatgtttt tgatcctttg gaatgccatg 34440 

agatccctgt gtactcatac ttgtgtcctc 345 00 

tgcttttctt gttttcttcc cttgtatata 34560 
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ctctcctttc ttcaggttac ttctccttgc 
cttctctgag ctcatgagaa aaaaagagaa 
5 tcagaaaagc acctctgggc tggttctggc 

atgatagacg tcgcgagaga taatattggt 
aaagagtgat aagagaatac taagaaggtc 

10 

tagcttagtg attggaaagg gggtggcgga 
ttgctttctt caccattttt aggttgtgca 
15 gggaataaca atgatgtgta tggcatctaa 

tatcgggtac acaatttggg ggtcagtaat 
agataacata aagcagttgt caataccctg 

20 

gtaattctgt aaatctgaga gtggtatcta 
aagtactaca aataatatag tattatatta 
25 cagttagata tattacagca gcacattgcc 
ggaagcactg attatttcac aacaagacta 
cagtggtgca aggatttcat caaggattgc 

30 

acacatccaa cttttggtaa atatttcata 
ataacggact tgccattatt ggcttggact 
35 gtctacccta cctttagctc atgagaaact 
gaagaggaga tgtggctggt acacaagcag 
actcaggaaa ttgttacctc agaaaatgaa 

40 

ttcaataaaa taactaatta ttataattaa 
tacattaaaa ttagaattgg aatttaaatt 
45 atagttttaa tgcaaattga gagtaaaatt 

agagagaaaa ataaaattta taagttgggt 
attgcagggt ttccatttat ttctaaaact 

50 

gagcttgctt tttgatcccc ttcacctgtt 
taaagctgtg ttttacttct agaaagctgc 
55 gccaataaca gctcttatta aactcttact 

atgtgcaatt tcatttcaaa caatcaccac 
atggatgaag agaggttaag taactaaccc 

60 

gattcaaata gggtcaatgt attttcagaa 



PCT/US01/48437 



tacaataatt 


gctttataat 


tcctaggcca 


34620 


ataattgtgg 


tagaagtaac 


atcgtaggta 


34680 


atactccttt 


taaccttgaa 


attattggca 


34740 


aagattacat 


tcacggttat 


ttatgtgaaa 


34800 


atatgactat 


caaatgaagg 


agaataagga 


34860 


agcaccattt 


ttgtggtgga 


caatcagttt 


34920 


gattctgact 


gccctgatga 


gccttagcat 


34980 


tacttatgga 


agtaacccta 


tttccgtgta 


35040 


ggtgagtaga 


gtatcttttg 


atataattga 


35100 


atctttggaa 


tcctattcta 


accgtatctt 


35160 


accatttctt 


actcaatttt 


ctcatattta 


35220 


aatctaggca 


gattaaatat 


ttatatatgt 


35280 


aaagacccaa 


accagaatgg 


cttaaacatt 


35340 


gtggcttatc 


atttctagga 


tggcttcatt 


35400 


tagcagacca 


catcagaggg 


gagatggttg 


35460 


tacatattcc 


caaatcaact 


gtcagtaagg 


35520 


aatcaagatt 


tacctctgta 


gctgggagga 


35580 


gaacaaaatg 


atggtttagt 


tagcaaggaa 


35640 


tttctaatat 


ctagtctact 


gactatcgaa 


35700 


aatgttattt 


gatatatttt 


gtttcataaa 


35760 


tattaaatta 


attataacta 


ttattaatat 


35820 


taaaaacctt 


ctcccaagag 


aaaacacaag 


35880 


gagaataaaa 


taccaagttt 


tatccattga 


35940 


ttatgttaga 


atcagggaat 


ctattctagc 


36000 


gagaatgagg 


tagtgtagag 


ttttatagaa 


36060 


tacagcagaa 


gtgcaagatt 


ggaagaacag 


36120 


acaatacttc 


atatatccaa 


aaatactgat 


36180 


atcatgtgga 


cactctactt 


agtgctttat 


36240 


gaacttctga 


tttttcatta 


taatcatttt 


36300 


aacatcacaa 


ctgcttggta 


caggaatgaa 


36360 


ttgcatacat 


ctaactactt 


agttttctgt 


36420 
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ttttatttcc gtttcctttt caagatcaca 
ccacacatgt tcagtagatt accaggaact 

5 

tgagactttc gtaggtgttc cattagatga 
cggctacctt ccattattct ttcaaccttt 
10 tttttctctt tcatacagta taagtaccaa 

ttttggcctc ttcagttctg tttctgtata 
atagtacata catacttttt cctgtggcaa 

15 

aatacacaga catacataca cacttgtaat 
ttccaaggtg ccttattgaa tttttagaat 
20 gaatgtttgc aatatatatt gttcatactt 
ctatttttct tatgtaatta ttttggtaaa 
gcatctttat cttgattttg actttaatga 

25 

ataaagactt ttagactgag acagatttct 
tttttttttt tttggttttt ggtttttggt 
30 caggctggag tgcaatggca cgatctcggc 
ccattctcct gcctcagcct cccaagtagc 
gctaattttt ttgtattttt agtacagacg 

35 

tcgtgatcca cccgcctcag cctcccaaag 
cctgccaata attctttata ttttaccagt 
40 tattaaatgt cctagcatat caacactgta 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnntgt ttgtagagct 

45 

tttatatcat atgatgggtg attccctcta 
catgaatcat ctctatgtct gggtatttgg 
50 tagttctgta ttttttatat tatgatataa 

agcaaattaa aaactccaac aattgaatat 
agttatttta gctaaatgaa aaataattgc 

55 

tctagtcatc atctaattgg gaatttacat 
aaaatattgg ccgggcgcag tggctcacgc 
60 cgggcggatc acgaggtcag gagatcaaga 



ttaaaaacca 


gcttacatgg 


tgcaagcctt 


36480 


gcaggagcca 


tttgtccgct 


gagacatcac 


36540 


gcctttactt 


aattggtgaa 


ctcttttctg 


36600 


ttttccaaac 


ctcttctatg 


aagtgcctcc 


36660 


taccaagaac 


cagtcacaag 


ttcctcttat 


36720 


tgtttcacat 


tccttgtttt 


ttcctctgag 


36780 


atgaagtcct 


cttattcaat 


cccatcttca 


3684 0 


gtgtatatat 


aatttctgta 


tagaatgttt 


36900 


agtttatggt 


aatttttagt 


tcacgttctt 


36960 


aaataagaat 


agttaaacca 


cttgttccaa 


37020 


tatacccaga 


atagtattac 


ataatagtca 


37080 


aatgatccta 


catttttatc 


attaaacata 


37140 


atcatgttac 


agaaattaca 


attctttttt 


37200 


tttttgagac 


agagtttcgc 


tcttattacc 


37260 


tcactgcaac 


ctccgcctcc 


cgggttctca 


37320 


tggaactaca 


ggcacccgcc 


accacacccg 


37380 


gggtttcact 


gtggtctcga 


tctcctggtc 


37440 


tgctgggatt 


acaagcgtgt 


gccaccacgc 


37500 


aatgttgcag 


ttactaataa 


tgttgaattg 


37560 


aacatgaata 


gcaataagcc 


tgtnnnnnnn 


37620 




nnnnnnnnnn 


nnnnnnnnnn 


37680 


gaaggaacat 


cccaaatgtt 


catgatgtat 


37740 


gtggttttat 


tccagtggtg 


atttagatat 


37800 


aagttaaata 


aatttattga 


gcttttttat 


37860 


tctttgaatc 


atctgaaatc 


aaatttgaaa 


37920 


taaaagatag 


agagtctaac 


caacttgact 


37980 


ctattgatgg 


aatcaaatga 


actactatat 


38040 


ttttttgcta 


ctattttgct 


tataagttta 


38100 


ctgtaatccc 


agcactttgg 


gaggccgagg 


38160 


ccatcctggc 


taacacggtg 


aaaccccgtc 


38220 
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tctactaaaa acacaaaaaa ttagcccagc gtagtggcgg gctcctgtag tcccagctac 38280 

tcgggaggct gaggcaggag aatggcgtga acccgggagg cggagcttgc agtgagccga 3 834 0 

gattgcccca ctgcactcca gcctgggcga cagagcgaga ctccgtctca aaaaaaaaaa 38400 

aaaaaaaaaa aannnaattc taaatgagaa taattttagt gtattttttg tttaaatata 38460 

gtcatcaatt ttggtaccat gtaagaatta ctttgtatca ggagtttgga aactatcttt 38520 

tagtgccctc aaacagtttt aatatcattt cagttagata tttgaaaaac ttgataaatt 3 8580 

ctcccatgga tttatctggt attatttctc tgttctttct gggaaaaggg aaaataatgt 38640 

15 gcaaaccttt gtggttttct ttaagataat tggtctgatt aacatatcta tctaatttag 38700 

aattaagttt agtaagttat gactcccaat actggcttaa agttttttac cttcattttt 38760 

tctctcattt ttagtttatt atttcaggat ccttgtcaat tgcagcagga attagaacta 38820 

20 

caaaaggcct ggtgagtaat attttctttt tttggtatca aaaaaaggaa ggattaataa 38880 

aaaatgtatt ctgccagggt gtaaaacagg ttttgtttct ttgtttgtaa aaaatcagga 38940 

25 gcgattgtgc attttaccta ctagaatctg tctttcaaaa actgcagtca ggctaattgc 39000 

tgacatagga ctggatattt tggcaaggta gggaatctta cgcttgtgga gggaatcttt 39060 

ttgaagatat agagaattat tgagaagaaa tatcaagtgg gaccgctcca acaaaaccct 39120 

30 

gcagcatgag tcagcccatg atagtgacac caagcattgg tgttgtgtaa tatttactga 39180 

cccattgatt ataatgtaga ctccaactgt gtccagaaag gattttgagt tttcaataaa 39240 

35 agaaattcac aataaggtta ttacataaga agcaaattta aaatgtaatt ctcatgccaa 39300 

gatgcctttc ataaagccaa tacaatgcga aagtccatat ctgaataaat gtttttttct 39360 

agttaaagtc catatttgag ttttctttgt tcaaaagata aatatcaaat tctggagcat 3942 0 

40 

gatcaatatg ggttttccaa atctggccta acccaactcc ctagtgtccc ctccggctgc 39480 

agccacatta ctcatactcc tcacactttg ctgtgcaagt ttagttcttt tttgtttgtt 39540 

45 ttttttctgg tttttggttt ttggtttttt gagacagagt ttcactctta ttacccaggc 39600 

tggagtgcaa tggcgtgatc tcggctcacc gcaacctccg cttctcaggt tcaagtgatt 39660 

ctcctgcctt agcctcccga gtagctggga tcacaggcat gtgccaccac gcccagctaa 39720 

50 

tttttgtatt tttagtagag atagggtttc tccatgttgg tcagactggt ctcaaactcc 39780 

tgacctcaga ttatccaccc cgcttcggcc tcccaaaatg ctgggattac aggcatgagc 39840 

55 cactgcgccc agtcagcaat tttagttctt attaaaagcc tccttacctt tatccatgag 39900 

aagagaacgt acagaaaaga atatactgct gagagagata atatacaaaa ggtatagagg 3 9960 

catgacaggc aaacagcatt tagaaaaatg ataagaaatc aggcatcaga agcaaggagt 40020 

ggcagggaag gacacagaca gggctgtcca gctaaagtgt gaaggacatt gaaaaccctg 40080 



60 
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ctaaagagat ttggagatta tattatgtgc 
cggaataatg agatcatgtt tgtgttttgc 

5 

gttagaagac tattgaaata atcaagataa 
gtgtaaagtg attcatggag atatatctga 
10 attcctttct aggtccgagg tagtctagga 

tcagggatct taatcaacac atttagcttg 
aactactatg gcaactcaaa taattgtcat 

15 

gcctcttttc aggggatatt attatagaag 
acaataatta atattcccta attgatgaat 
20 agaataatct ttcatttagg agttaggtac 

tatcatgagg caaaatggga agaaagctta 
actgtgctgg gtattttgta tacatttttc 

25 

tactgttatc ccctctaaaa atggggccct 
taaaattgtc ttgaacagta ttttacaagt 
30 tttggatgtt cttgcaaggt taaccacttt 
cagtcttcat tatgactttt actacttttt 
ctaattgctt tattctgaaa atgccaccta 

35 

gatattttgg gccattgggt aattgtgggg 
actaaaatgc acaattaaat ctttattatt 
40 cctgtaatcc cagcactttg ggaagatgag 

agaccagcct ggccaacatg atgaagcccc 
gcatgatggc acgtgcctgt gatcccagct 

45 

gaagctggga ggcggaggtt gcagtgagcc 
aacagagcga gactgcatct caaaaaaaag 
50 caaatttttt gttactttcc ctctatatga 

ctcttcctta ccacgttctt cgcttatgta 
actaagcaga atcagttgca ttctgtttac 

55 

tgggctatat aatcacacca aaataacccc 
cccttgagat ttctattgct ttttatagta 
60 ataagaaaaa gcttctatag agaatttgac 
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ccacgggaat 


catctgaaga 


attgaagctg 


40140 


caagagatta 


gagttgaaac 


cacaagacaa 


40200 


tgatagatgg 


cagtaagaaa 


gagatggtgg 


40260 


ggatgttgtt 


ggattgttta 


attgattttc 


40320 


atgaatatca 


ccagctctgt 


actggctgca 


40380 


gcgttttatt 


cattccatca 


cccttactgt 


40440 


gggactatgt 


ccatcttaat 


ggttggtact 


40500 


caagtttggg 


gagtgctggc 


tctggcaaag 


40560 


gactttggga 


agttggagtg 


attgagaaat 


40620 


atgtctcatc 


tctgtcttta 


ttcttctata 


40680 


tagttataga 


atacctactc 


attaatagtt 


40740 


tgtttaattt 


taaaacattg 


atgcagtacg 


40800 


gatgtcagtc 


cctattctct 


aataatgaag 


40860 


ggtgagtagc 


tcagtatcct 


ttccaacatc 


40920 


tatcctcaac 


atctagctcc 


tgataaatct 


4098.0 


tgtaccaatt 


gttcttatct 


taatgctatc 


41040 


cacaatattt 


tgtaggtagt 


aaaacttctt 


41100 


aggaaaagtt 


attgcagcac 


atcacagcag 


41160 


aaaaagcctg 


gcagggcgca 


gtggctcacg 


41220 


gtgggtgggt 


cacctgaggt 


caggagttca 


41280 


gtctctacta 


aaaatacaaa 


aattagccag 


41340 


actcgggggg 


ctgaggcagg 


agaattgctt 


41400 


gagatggaca 


cattgcactc 


cagcctgggc 


41460 


agaaagccta 


ttggtcattt 


tgaacacctg 


41520 


actttcagcc 


tttcaccttg 


ctgcttagtt 


41580 


aacgtggaaa 


cccaacattc 


ttcctaactt 


41640 


ttcattccta 


gctaattcta 


catagttttg 


41700 


ttgaggtgtc 


tttgctaaac 


taatttactt 


41760 


agcaatttta 


cttcttgcca 


ctcgctcaaa 


41820 


aaattctttc 


aaggtctttc 


agttagaaac 


41880 
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tggcaaaagg 


aagattagat 


tctacaattt 


cttgtctcta 


attcccctat 


tatttgctat 


41940 


ttgtatcctg 


cagaaggacc 


ttacttttag 


tttacgaagt 


tctcttcatt 


ggtagatttt 


42000 


aagggcacaa 


tgatagagac 


aaaatctgga 


attggccaac 


ttgatagggc 


tgatttcgtg 


42060 


ggcatgggac 


ctgtgcagtt 


actcaggact 


tcacagttaa 


aagagccctg 


tgcttgatct 


42120 


aatgctctgc 


tgttgctatc 


ttaaaattgt 


ttttaaaaat 


tttaaacaag 


agaatctaaa 


42180 


ttacatagct 


ggtcctccat 


tcatctcaat 


cacctttccc 


agagtccatg 


atgaattcct 


42240 


gttaattgtg 


tgaacatctt 


tatgcctgtt 


cttatagcag 


aggacctcca 


taagggcttc 


42300 


ctgggatttg 


aagtaatggt 


tctattcttg 


cctgggcacc 


tgcacaatca 


gtccacaggt 


42360 


ctatatacat 


gctacctttt 


tttggaggcc 


cagccctgta 


cctgtttttg 


taaataaagt 


42420 


ttcataaata 


aagtttaatt 


gaaatacaac 


gaagcccatt 


tgtttacata 


ttgtggagtg 


42480 


ctgctttcgt 


gttataacag 


caggtttgag 


tatgtgtgac 


agagacctta 


tggtcggcaa 


42540 


aatctaaaat 


gtacattatc 


tggcctttac 


agaaagcttg 


ccgtcttcta 


gtctacactt 


42600 


ctagatggta 


aaatccttga 


agacagggga 


attgtctgtt 


ttgtttgctg 


ctctatcttc 


42660 


tgtgcctaca 


acaatgttgg 


aaacacagag 


gccactacag 


ctgttgaagg 


cacaagaaaa 


42720 


taaataagca 


atgaatgcaa 


atattctcaa 


ggtctcctta 


agtctgcctc 


actcatatcc 


42780 


tgagctcttg 


gtctattggc 


tcttctgcaa 


cagagggact 


atccctaacc 


atggagagaa 


42840 


aaagagagaa 


tatttaaacc 


agataatccc 


tgcatcaatc 


cgggggaaaa 


ttaatagaaa 


42900 


taagtgtact 


gcccaatgtt 


ttccaaattt 


tgaccatgtt 


tcttggctga 


agacnnnnnn 


42960 


nnnnnnnnnn 


nnnnnnnngt 


ctggatggca 


tggtgctcct 


cttaagtgtg 


ctggaattct 


43020 


gcattgctgt 


gtccctctct 


gcctttggat 


gtaaagtgct 


ctgttgtacc 


cctggtgggn 


43080 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


gaattcagaa 


43140 


aagcagtggt 


tttggggcat 


atgatagcta 


ggtctctatc 


tcttcctgtt 


cttccaccaa 


43200 


ccctatgtca 


cactgacttt 


gtggaagaat 


aaccagaact 


cttcttagtt 


ttgtgatgtg 


43260 


tgggccagtg 


tggtttttgc 


aatgtattat 


tcataactag 


caaatcatga 


tttgggtatg 


43320 


ttgatgtcat 


cagcatgaac 


aacctggttt 


ggaagaagag 


ccccatattt 


ttcataacac 


43380 


cttggtttac 


ccaacgtact 


gtgcactgtc 


atccacacgt 


aatattgatc 


cacttgcgag 


43440 


accagagcag 


tggtcatcag 


cagctcaggt 


caagagctat 


cactgaatgt 


tgtgtcctaa 


43500 


atcaggcctg 


atcaagagag 


tgactgagaa 


accccacata 


cttgatcttg 


agaatgaatc 


43560 


agagaagaaa 


agagtaaact 


ctgataatga 


tgactttatg 


tgggcaggaa 


aggggcactc 


43620 


tgaggtgatt 


tgggtgactc 


acctttggca 


ttctgattgt 


ttcacaggtt 


gtgttaattc 


43680 


tgccatcaca 


ttctcacatg 


gcagaaacag 


catctcccac 


accacttaat 


gaggtttgag 


43740 
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gccaccaaaa gatcaacaga caaatgctcc agaaatctat gctgactgtg acacaagagc 43800 

ctcacatgag aaattaccag tatccaactt cgatactgat agacttgttg atattattat 43860 

5 

tatatgtaat ccaattatga actgtgtgtg tatagagaga taataaattc aaaattatgt 43920 

tctcattttt ttccctggaa ctcaataact catttcactg gctctttatc gagagtacta 43980 

10 gaagttaaat taataaataa tgcatttaat gaggcaacag cacttgaaag tttttcattc 44040 

atcataagaa ctttatataa aggcattaca ttggcaaata aggtttggaa gcagaagagc 44100 

aaaaaaaaga tattgttaaa atgaggcctc catgcaaaac acatacttcc ctcccattta 44160 

15 

tttaactttt tttttctcct acctatgggg accaaagtgc tttttccttc aggaagtgga 4422 0 

gatgcatggc catctccccc tccctttttc cttctcctgc ttttctttcc ccatagaaag 44280 

20 taccttgaag tagcacagtc cgtccttgca tgtgcacgag ctatcatttg agtaaaagta 44340 

tacatggagt aaaaatcata ttaagcatca gattcaactt atattttcta tttcatcttc 44400 

ttcctttccc ttctcccacc ttctactggg cataattata tcttaatcat atatggaaat 44460 

25 

gtgcaacata tggtatttgt taaatacgtt tgtttttatt gcagagcaaa aataaatcaa 4452 0 

attagaagca atactttcat gtgctaccta cttccttttc agatgcaata tggtggaaag 44580 

30 agactaagcg ttgatgtcaa aaatacagta tggtggtaaa taatgtgggt tttctaaaca 44640 

gacaaccctg ggttttcatc ttgctatccc atctttgttg tttactactt atgtagctat 44700 

gaagatgcca caaaatttct ctgtgcttca tatttctttt taggaaggta gtaataatat 44760 

35 

tgtctggccc atacacctat gataattact tggtataaca aaggtagaaa aaccaggact 4482 0 

ctgactggta catatggagc cacgaaagtt ttaattaata tatatacatg tatatatgta 44880 

40 aatagatatg tattagatat atatatcaat ttatcatttt attcagcata tatctgtaaa 44940 

gtatctgtaa tgaggcagac attgttttaa gcatataaga aaatagtggt aagagagata 45000 

atatcttata tatataagat aatagtggtg aataagagag agaacatcct tcctctcttg 45060 

45 

taggttaaaa cttttatggg acaggacata gattaacata taaacaatca aaaccttttt 4512 0 

atatgttgtt agaagctaac tggaagaatt aatgtgctag agagtggctg ctgtagtggt 4518 0 

50 ggtgagagat gagtcattag aatgtaccca ggaaagtttt ttttttgagg aagtgatatt 45240 

ttcattatga taagataaaa attattgaat ttgcacggat attcgtattt tccaattcag 45300 

tatatagtat ttctagattt ctcattctgc cctacactca catatctatt gtatccacct 45360 

55 

ttctccatgc acgcttttca tgggaatgta agtccctcct tctatcagtg ctaacctgtc 4542 0 

tactttctcc agctccaggg atggataagg tgtggtaaaa gtcagttaag acattctatc 45480 

60 ttcttggttg aaagttggaa ggtaattagc aagaaccacc aaatagaact taatactatt 4554 0 
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gcttttcttt cttttttttt ttgaattttt ttttxttttt tttttgagac ggagtctcgc 45600 

tctgtcgccc aggctggagt gcagtggcgc tatgtcggct cactgcaagc tctgcctccc 4 5660 

cggttcacgc cattctcctg cctcagcctc ccgagtagct gggactgtat acaagcgccc 45720 

gccaccgcgc ccggctaatt tttuatattt ttagtagaga cggggtttca ccgtgttagc 4 5780 

caggatggtg tctcgatctc ctgacctcgt gatccgcccg cctcggcctc ccaaagtgct 45840 

gggattacag gcgtgagccc ccgcgcccgg ccaactgtta atattttacc atgtcataaa 45900 

gagatattgg tctatgtatg tacctaattc caatctttta attattgtgt atgtactgat 45960 

15 atcctacttg ttttatttct tcaatattgt attaatactt ggcagatttt gtactcggca 4602 0 

tttattcccc atttgttctg ataatgatgg cctgatttcc tccaaaaaat caccattatc 46080 

taactcttaa tttatgtaca agggtggagc tcacccaacc cccaatttca ggagttgacc 4614 0 

20 

tgtgtcctac atctgggaag tggcagaatt acatcaactc agctagagtg aagttctttg 46200 

gtagccttgt gacctaggaa aaaacaattg tgtgaatgac tgacacattt ccaatcaaaa 46260 

25 ttaatctcat cattttcact tcactgacca agaaatggtc tccccatctt tcctctttcc 46320 

tgttgggttt ggagctgtga ggacgtaagt ttgctgccat cagcagcccc gtgacagaca 463 8 0 

gatgctcaca gacaaaagca gaactgagaa gttccaagtg aacgatcaga ccctaatgta 46440 

30 

attcttttga gtctccaagt caagttttac ccggagttgg cccaacccct ggggttttaa 46500 

atatataaag acaaggaaag agatttattc agatagtgca ataactgggc aaaatccttc 46560 

35 aacggcttcc tctctaattc agaataaaag ccaaagctca tataaatcct acatagtcct 4662 0 

tacatcacgt gattctttgt tgtttcttgg atttttaaaa tatcctgact tatatcctga 46680 

tatattgctt tgagattgtt taaaatggta agagagtgcc ttctcttggt gataattagt 4674 0 

40 

gcatacataa tttttttaac acttacataa gcaacctaag attagtgcat ttttagttgt 46800 

aaaaagttca tgcaaatatt ttgtgctatt agtgcaagat attatgattc caacttttag 46860 

45 acaagactac ctgcacttgt ttcatgtcat gtgatttgaa atttattctt catttggtta 4 6 920 

ttcttggttt tatctatatg tgattagaaa gcagtaggaa aatgtccatg agactttgtc 46980 

tccttttgct acttttacaa taaaaataat tattgtttca taggtttttg tttttttttt 47040 

50 

tgagagaggg tctggctctg ttgcacaggc tggagtgcat tagaacctct gcctcctgga 47100 

cttaagccat cctcccacct cagcctccta agtaaatggg actacaggtg tgcaccacca 47160 

55 cacccagcta ctttttctat tttttgtaga gacagtgttt caccatgttg cccagtttga 47220 

ttccctgtct gcttaatgat tgcaagaggc tcccctgctg ggttgaagtt gggccaagaa 47280 

ttgccttagt agtggttgat acggttggga tggtatggtt aaagttgaac tcatcattga 47340 

gggaagcatc ttgggaccgc acttcgcctt gtaaataatt ggaaaaaaaa agttataagg 47400 



60 
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tcttgagtat 


ataaagtgat 


attattgata 


5 


aggtagatgc 


aagatatctg 


tgactaaaaa 


ttttgggttt 


tttttttgtc 


ttagactcac 




aaatttgtgt 


tatcaaatat 


ttctgtcccc 


10 


taccagtcct 


aaatatcaaa 


gattataaga 




ttgaggcttg 


tgtatttttt 


cagtcaatga 


15 


tggtaagtgt 


actataatca 


gcattgctac 




tccaactatt 


aaaagtaatt 


tatatttata 




gagaagtctc 


aaagggctca 


ttgttataac 


20 


tqtaaaatqa 


taatactgta 


tgcaaagtta 




gttacactta 


cattactcag 


taaattggat 


25 


catctcgggt 


gttcccactg 


accattagaa 




taaaatttgc 


cattttccac 


aactcttact 




cttgcaatac 


tctatggaaa 


ggatttcaag 


30 


tgaattaaac 


cagaaggaag 


gaaataaaga 




aatagatgca 


ttttatctta 


atacttgact 


35 


tattatgaat 


cttcatcatt 


aaaatgtatg 




agcatatgtc 


tcatattatt 


tttcttcatt 




tatttcatgt 


acattttgga 


attacttcgt 


40 


qqattttqat 


taagaaggta 


ttgagtctat 




aatattgagt 


tttctaatcc 


attaacagaa 


45 


ttatttattt 


atattttttt 


gagatggagt 




ggtgtaatct 


tggcttacta 


caacccctac 




cagcctccag 


gtacctggga 


ttacaggcac 


50 


tcttttagta 


qaqatqqqqt 

3 ""*3 " w j 333 *- 


ttcaccatgt 




cacagcttag 


tgttagaaaa 


caaatttaga 


55 


ctctgcatag 


ctatcccctt 


tatggttctc 




tcaaattctg 


gtttttgtct 


tctgaatttg 




catggtctgg 


aaagtttctt 


catgcagaaa 


60 


ttcatttttt 


cagtgttgaa 


agtccggcac 
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acttctaact 


ataagtagtc 


caagctcttc 


47460 


tgattttttt 


aaaaaatttg 


aatccagtat 


47520 


tgaggcaaaa 


tgaaggcttc 


acgctaagtt 


47580 


tttctcctct 


aaaatctaat 


ttc tcttctc 


47640 


aaagtaagga 


taattaaaga 


ttaaagaata 


47700 


taaagaaacg 


ggcgatcaat 


aataagcaac 


47760 


aaaaactgct 


atctattgag 


aaaacatgca 


4782 0 


aaatcaaatt 


aaacccactc 


gtaaatgtaa 


47 880 


agttataata 


ggtattttta 


catttgaggt 


47 94 0 


gaggcaaatc 


aacagttcta 


acagtgatgt 


48000 


tatagagcat 


ggaacaattt 


ggggaagaat 


4 8 060 


atctgtagtt 


tatcttaaat 


taactatttt 


4812 0 


tttcctgatt 


ctgagtcctg 


atcccataaa 


4818 0 


agagacatac 


ttttatcttt 


aataaaaaca 


48240 


gttgaagccc 


actgctgtgg 


ctgctgaagt 


4 8300 


agaaatttcc 


tgtatcttca 


ttttgtatct 


48360 


atagctttat 


ggtaggtctt 


gaaatcaggt 


4842 0 


aacaaaattg 


« J» 4— 4— mm -mm X— — mm* mm* 

atttagtcaa 


agtctttaat 


A fi A o r\ 

4848 0 


caactttatg 


caatccaaaa 


agaccttctt 


48540 


aggtctataa 


gggaaaattg 


ccaacttaac 


48600 


i i _ , j i i j_ 

tatacctttt 


cctttattta 


tttattattt 


48660 


ctcgctctgt 


cacccaggct 


gtagcgcaat 


4 872 0 


acctcctggg 


■L._. J_ .... _ L J_ _ A_ 

ttcaagtgat 


tctctcacct 


4 8780 


acgccaccac 


gctaggctaa 


ttttttgcat 


48840 


taaccaggct 


agtcttgaac 


tccgtaaatg 


48900 


atacttccct 


gcagatttct 


ggacctctgt 


48960 


ttctcactaa 


ttatagctgt 


ctcactgccc 


4902 0 


gcaatatcag 


aatggatccc 


tctccctgac 


49080 


accagagcaa 


ttattatatt 


cacatttgtt 


49140 


tgtttcccaa 


tgtctggaag 


cagttgtgct 


49200 
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taattttttt 


ccggttttat 


ttatgatata 


aaaaaatttt 


tgagacaagc 


tctaactcta 


49260 




acacccaggt 


ttgagtgcaa 


tggtgcaatc 


atagctcact 


gcagcctcaa 


aatactgagc 


49320 


5 


tcaagggatt 


ctccaacttc 


agcctcccaa 


agtgctggga 


tcaaaggtat 


gggccactgt 


49380 




gaccagcgca 


gttttctaac 


tgaccagagt 


ggaagattaa 


gtctggttct 


atcataattg 


49440 


10 


gaagagctga 


gtttcaactt 


taaaaagatg 


atcataggct 


gggcatggtg 


gctcacgcct 


49500 


gtaattccag 


cactttggga 


agtcgagggg 


ggcggatcac 


ttcaggtcag 


gagttcaaga 


49560 




ccagcctgac 


caacatggtg 


aaacctcgtc 


tctactaaaa 


aatacaaaac 


ttagttgagc 


49620 


15 


atggtggtgc 


gcgcctgtaa 


tcccagctac 


ttgggaggct 


tggcaggaga 


attgcttgaa 


49680 




cccgggagga 


ggaggttgta 


gcgagccaag 


attatgccac 


tgcattccag 


cctgggccac 


49740 


20 


agagtgagac 


tccatctcaa 


aaaaaaagct 


catgaacaca 


gatgttagca 


tttttatttc 


49800 


atttgtttat 


atttaaaatc 


gatttaagca 


tgtcaacata 


tgtaacttaa 


taaaattaat 


49860 




acttataatt 


taaaatattc 


aagcagaaaa 


ttgaagtatt 


gtccttttca 


tatgctgata 


49920 


25 


ttttagtcac 


cttaattggg 


tttcataaga 


agcaactatt 


aaaagacact 


agcatgttgc 


49980 




aattattttt 


tatttaaatt 


aatttgtgtt 


attttgcaat 


gttttatata 


gtaatatcca 


50040 


30 


gacagtttta 


atatacactc 


aaaagttttg 


ttttcacaca 


acataaatat 


aggaaacaca 


50100 


atgagacaaa 


gctatacaaa 


ttccagggaa 


aatatttaaa 


tctttgaatt 


tattaaaata 


50160 




tgtctctcag 


tttgaatttt 


gagtttactg 


taggaaaaca 


cagaaatgat 


gtgtaaagta 


50220 


35 


acttcctttt 


gattgcctgg 


tggatagaaa 


aacaaaaaac 


aactaaattt 


gagatatggt 


50280 




ttgaacagag 


ctgatgatgc 


atttgttttc 


ctgtttgaat 


ctcctcaaga 


tcagcaattt 


50340 


40 


gtttttttac 


tgactaaggt 


ttgttcagat 


tgctactgct 


catatgattc 


ccaggaactt 


50400 


aagtactttc 


gagtaactgt 


gtgaggctga 


taacactaga 


tttaaaaaaa 


atttgtcttc 


50460 




aataagtaca 


ctctttccag 


cagaggaagc 


agttcagaaa 


gttctttgtg 


gtttcccctc 


50520 


45 


tggatctttc 


tatcatcaat 


ttcagcaaaa 


ggtggtattt 


ttcttttaat 


aaaaggtcaa 


50580 




gaccgcttgt 


tttaatcctt 


ctcagactcc 


aggtacagct 


taaccagata 


tagcctgtaa 


50640 


50 


cttgctccca 


tctcttgaat 


gcatgctttc 


agcttggcca 


ggaggggctg 


gacatcaaaa 


50700 


ggaggagagt 


cacagtctgg 


atggggctca 


ggaacctcac 


agtgctgtat 


aagccagcta 


50760 




tcatctaggg 


gggttctttt 


ccccttaagc 


cagcagcaaa 


atgcttcaac 


ccttgtcaat 


50820 


55 


tactttcagt 


aactgattct 


tttgtcttgt 


tttttttcta 


cttagtattc 


tctcccatat 


50880 




ttacraaaata 


ctgaaagcat 


tcccgtttta 


cctctaactc 


tgaagccaca 


gaaactgtgc 


50940 


60 


ccataacagg 


gttgctaaat 


ggcctaaatg 


acatatatgc 


aaatatatat 


acacaaacac 


51000 


acatgtgtat 


ggaatatata 


tgtttgtata 


ttatttgaaa 


atatatttaa 


ataacttgca 


51060 
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tattacatat aaatttatat atatttcata tccatatatg catatacttt ttctcataca 51120 

ttataaaact ctggcaacat ctacatgtaa catatgagtc tttatttgtt ttgctgacaa 51180 

5 

agattatctt tcccagaatc tggattagaa aatttagtta tttagtattt ggttgttatt 51240 

atttcaatat tttgatttaa tttattttgg aacagaaatt agcctgcaaa ttctgcaaat 51300 

10 ttttcttaga aatgtatcat atgcataaat ttgttttagc tgacctagaa aacggagggc 51360 

ttatatttcc ctttttattt ttctatataa aagatatttg atttaagaag aaaatcagtt 51420 

cgaagtcttt gtaaatctga tatattctct tttcccaatt gtattgccca agtcagagtc 51480 

15 

tattattttc cacaggcaga tatttgaaag gacttaacaa ataccacaga aggtactcac 51540 

ataaggcttt tatctaaagg cagagagtac aatgcagcaa gagagagagg gcaagcactg 51600 

20 gagacattcc aagcagtaca agacattcca gatagtacaa gctccccata gacctaacca 51660 

gacacatgaa ccatgccagt ttctgacttg aactgcccag atctatgcaa gaaactctct 51720 

atatcccatt aagtagtagt caagcttgtc aaactcatta ggctagttgc aagtcattag 51780 

25 

caaagaaact gaccttgggg cttgccaaga gagtttaaaa ttattctaaa ctgcatatta 5184 0 

taaattctaa tggtcttatc ttgaccaaga gccaaaattc atacattcag gcatccccag 51900 

30 gattaagaca gagctcttaa gtacctctgt aannnnnnnn nnnnnnnnnn nnnnnnnnn 51959 

<210> 77 

35 <211> 84248 

<212> DNA 

<213> homo sapiens 

40 

<220> 

45 <2 21> MS4A4E_initial_coding_region 

<222> (18861) . . (19004) 

50 

<220> 

<221> MS4A6A_initial_coding_region 

55 <222> (67182) . . (67328) 



<220> 

60 
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<221> MS4A6A_coding__region 

<222> (68944) . . (69078) 

<220> 

<221> MS4A6A_coding_region 

<222> (70593) . . (70649) 

<220> 

<2 21> MS4A6A_coding_region 

<222> (73298) . . (73507) 

<220> 

<221> MS4A6A_coding_region 

<222> (75780) . . (75881) 

<220> 

<221> MS4A6A_coding_region 

<222> (76656) . . (76748) 

<220> 

<221> MS4A4E_coding_region 

<222> (42168) . . (42236) 

<220> 

<221> MS4A4E_coding__region 

<222> (40220) . . (40321) 

<220> 

<221> MS4A4 E_c o d i ng__r e g i on 

<222> (35782) . . (35940) 
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<220> 

<221> MS4A4E__coding_region 

<222> (34287) . . (34343) 

<220> 

<221> MS4A4E_coding_region 

<222> (23822) . . (23950) 

<220> 

<221> genomic_DNA 

<222> (1) . . (84248) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 



<400> 77 
nggataatat 


taggctaaat 


acctaatgta 


gatgacaggc 


tgataggtgc 


agcaaaccat 


60 


catggcacgt 


gtatacctat 


gtaacatacc 


tgcacttctt- 


cacatgtatc 


ccagaactta 


120 


aagtataaaa 


aaattaagaa 


atttttaaat 


aagaaacctg 


aaatgtagat 


attctatagg 


180 


aaaagtgaca 


tggtttcttt 


aatagataaa 


tgacatgaat 


aaaaaagagt 


gggaagggga 


240 


aaatatagat 


tgaaggagac 


ttaattaaca 


tatcaacaaa 


atgcaatgtc 


tggcctttgg 


300 


atggatgggg 


ctggggtgtt 


cttgacttaa 


gaaaatgaac 


aaatatgtat 


gagaaaataa 


360 


taaaatttga 


atatgaactc 


tatattcgat 


gatactaagg 


aattattgat 


aatgctgtaa 


420 


aatgtgataa 


tgatattaga 


aaatgttatg 


gaataagtgt 


catctgttaa 


tgatccatac 


480 


caaggtataa 


actagtgagg 


tgatattaaa 


tgtgagattt 


actttaaaat 


actcttacaa 


540 


atatgtgcag 


ggtggagaga 


gatgagaaaa 


cattggcaaa 


ctatcgataa 


ttgttgaagc 


600 


agactggtgg 


atgtatggta 


attcattata 


ctaacctccc 


tacttttgtg 


tgtgttttaa 


660 


attttccaga 


agaaaaagta 


aaagacagaa 


agtggtgatg 


gtttcatgag 


tgtatattta 


720 


tctaccaaat 


gatcaagttg 


tatacattaa 


atatatacag 


ctttttgtat 


gccaattata 


780 


cctcaataaa 


gtggattttt 


aagacagata 


agaaataaaa 


ataagataga 


taaaaactac 


840 


acgtgattgt 


gaaattaatg 


aaatgagtgt 


ttttagctcc 


atcagttcag 


tttggttctt 


900 


tcttaaaata 


gcaatttcat 


gttttatcta 


ctgtactatt 


ttattgtatt 


ccttagattc 


960 


attggattag 


gtgtagactt 


tctgctgttg 


atgatctttg 


ttcctatcca 


tgttctgaat 


1020 
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tctacgtctg acatttcagc ctttcagcct 
agtaatttgg agataagaag agactctagc 

5 

cagtcatctg tttgtgctga tgttccttca 
tttttttttt cttttattgc ctttgatgcc 
10 tttagttgat tggctttgat tctagtcagt 

gtattggatg aggcttccct gattactgtc 
ctggtccaca gaactccctt aggcaggggc 

15 

gcttagcgct aatctgctgt ctgaatgctt 
tggagtttag gtagaagcag gaccactggg 
20 caaagattgg tggagttgcc caccctatca 

cacttcttgg taatttcagg cagaaagaag 
tgtggctcac ctgtctacca gtggtaggag 

25 

tgcttcccag ggaaacagga ggctgcacct 
ctaggccaga actctagcag gcattgccca 
30 tcacctgccc tgctgtcagg gtgttttctg 

cctaaagact gcaatttctg gctgagttca 
ctagcaagca ttgcctgccc ggctaccagt 

35 

tgggtgcttt ctgggacaag agtaggctat 
aatgctgggt aggaaggtgg cacaaggaat 
40 acagtcccta atcaagagag gcaagagcta 

gactgcaagt gtcttttctt ggtagaaggc 
tgtctggtag tttgtcattg ataccttctg 

45 " " - - 

ttctaatcct acacaagcat tttttaactc 
atcatcttat ttttatcaag aatattgctt 
50 gtattctgga tatatatcct ttcttaatta 
gggttgtcaa tttttgctat gtatatttag 
aaatgattac atctttctga agattgtact 

55 

tcatcagtaa catattttgc tctaaggtca 
ggtttgctcg atatatcatt tctcatcctt 
60 ttacttcttt ttaacagtaa ttagttcagt 



ggttaaaaac cattgccgga gaactagcgg 1080 

tttttgagtt gccagttctt gcacagctct 114 0 

atctttgaag ctgctatctt ttagatgggc 1200 

cttgggggtt tgattgtggt ataagttggg 12 60 

agactggctt tgatcctagt cagttcctgg 1320 

tctctgctca cat t tat tat gttgggcatt 13 8 0 

cacagttggc ggacaagtag tatccttaca 1440 

tccagggtaa cacagtgttg tccccaccta 1500 

ctataagctg gggtgtctgg ttacaaaagg 1560 

tctgggtgtt tctggggcaa gagaatgctg 162 0 

tagcactgct gagctggaag ctccagcaga 1680 

tgggtagggc tgtcctctct ggcatctgga 1740 

gctggctcag ttcacagaga agtaggacca 1800 

ccaggctacc agtggcaggg ttgggttgca 1860 

tgattacagg aggctgtgcc ctctggcaca 1920 

catataagaa ggactactgg tctggaagct 1980 

gtcgggggtg cattgcttgt ccttcttgtc 2 04 0 

ggctactggc caagtccagg cagaagtgga 210 0 

attatgtgga gattatttca gaaccctaca 2160 

aaggaatcag gaaccctagg agggctatgg 2220 

ccaacagtta gctcaattgt ctgtgttaaa 2280 

ttacataccc tctatgctcc atcaatcgta 2340 

tgtatttctc ccacctcctc aagccaagcc 2400 

acagcctctg aatgttccct ggttttttat 2460 

catgtattaa aatattatct aatattctgt 2520 

aggttctgtt attaaatata tacacactta 2580 

atttttataa ttaaaaatta atccctttgc 2640 

attttctctg atcagttttt tttgggttaa 2700 
gtactttaca gtttttgtat gcattaaaat ' 2760 

tttgaccttt taaaatttta tctgaaatat 2820 
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ttgtatttta 


ttacatatga 


cgtatttatt 


tatatatttg 


ggattaaatc 


taccatcaca 


2880 


ctattgactt 


tcctggagga 


aataaatgtt 


ctaaatgtta 


ttagaatttg 


ataaatcaaa 


2940 


tacacatgct 


ttaattttta 


ggattacaag 


aacacacaat 


gaaagacttt 


ataagtacta 


3000 


aaattaatta 


gagggagaaa 


taatgcaata 


aaggggtata 


tagaaccttc 


aatcaaaaaa 


3060 


atagaaaaca 


tttttcgttt 


gttcatttgt 


ttgttgtatt 


tggtttctct 


gttttacgat 


3120 


attgattttc 


ttaaatattc 


agtgttcatt 


tacactttca 


tttggtaatt 


tctcctatca 


3180 


tgtcagggaa 


tgtaggtctt 


gcttaccata 


gcttacatcg 


ggtgactgtg 


agagaagagg 


3240 


gggcataaag 


ccaggttata 


catacagatt 


99 tt 9 t 9 at 9 


tagttgagag 


aaagcccctt 


3300 


ccttctccag 


actaaggtga 


cctctgtgtc 


ttcatcatca 


ggatcccagc 


tctaaagagt 


3360 


gagcgcagga 


ctttcatttg 


ggaatcaatt 


ccataatatt 


ctctgctttt 


tgcttttcct 


3420 


ttataccttg 


tgaatggaag 


cttagacctt 


aattcttgtc 


tctttcctgt 


tttttactca 


3480 


gagctggcac 


tcacattcta 


tatacttgtc 


tagagcagaa 


ccctaaaaca 


gttttttggt 


3540 


ttggtttttt 


gctcataaaa 


cagttctcat 


attttggttt 


ggggataaac 


taccttgtta 


3600 


ggtgatgcct 


gtgtgaggga 


cagaggaaca 


actgaccccg 


cttttccaat 


ataattttaa 


3660 


aaattaatta 


acaactgctc 


tgggctctga 


cctactctgc 


tgcttgttgt 


tttgacactg 


3720 


aacttgagct 


cttttttatt 


tgcttttatt 


tgtatagtag 


ttagttttgt 


cctatgctta 


3780 


tttagaggta 


gggtgattct 


atttatgttt 


ctctgtcaaa 


tttcaatttt 


attgttcttt 


3840 


atatcaccat 


aaatcctcaa 


tcacttgtct 


gctaagaata 


acactaataa 


tatctcactt 


3900 


cacactatat 


ggaagaggag 


gtaagattga 


catctgttct 


tagtctagaa 


tttcaacctc 


3960 


tcttataaag 


tcatgaaact 


ctgtaccttt 


gtcttctgta 


atgccacatt 


ttcctagttt 


4020 


gcctgttatc 


ttctttaatc 


tctttttgcc 


atcctctgga 


aacccttatt 


tctgataata 


4080 


ctgtttctat 


tctctagcaa 


aaagggtctg 


attttttctg 


cctcctccca 


caagcctggg 


4140 


taacctcatc 


cctctgggtg 


aagagctgtt 


gggtttacct 


ctacagccat 


gttctcattc 


4200 


cctagctcct 


gccagttatc 


caacttttgt 


gaaacttatt 


tgaaggtgcc 


cacacaattc 


4260 


aaattaaata 


tgctccaagt 


cttactcttt 


gttcctgcag 


ttctgtcttg 


gtccatgtct 


4320 


attgccttac 


tgaatggctc 


tatcattcat 


tcagtttcct 


aacccagaat 


ctgagcgcca 


4380 


tcctttcctg 


ttttctctcc 


tgatttacac 


atccattata 


tcaacaagta 


ctgtcatttc 


4440 


tcgttcctaa 


acatctttca 


aatccattca 


ttctctccaa 


cttcactagc 


atcataggta 


4500 


gttcagtaac 


catcatccct 


tccctagaga 


actgcctctt 


tcttaaccct 


aactccctga 


4560 


tttcagtatt 


atttattcct 


attgttgctt 


tacatgaaaa 


ccaggacaat 


atttctgaaa 


4620 


tataacccat 


ttaattctga 


gaattaaaat 


aatttcagat 


tatagacaac 


aaagtttttg 


4680 
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ggattaggca aaactggatt tcaatctcat ctataccagc tatatgaata tctctgagtt 4740 

tcagttccat cttctggaaa atgggaataa taatgttacc tagctccatc cttcttccat 4800 

5 

ttgtgtgtct tttgcttatt catcaggtct ccaattagat gtccctttat ctaggaagtc 4860 

tcctctgcct acccaacgtt gagttctttg atcctcctat gagctctctc aggatatcca 4920 

10 attttacctt gcagatcact gattataatg ggttgtaatg atttatttat tcatcagcat 4980 

tgccacttta ccttgcaggt aactgattat aatgggtgat ttatttactt gtctatattg 5040 

tcaagtagcc tataaaatcc ttgagcgaaa atcatttttc accattttat ctgaagcaca 5100 

15 

tggcacactg cttggcttat agtagctttt ctctaagaaa atgctgagtg aatgaaaatt 5160 

gcatgccact ttgctttcct ccatgtaaag aaggcattat tttattttac caatgaatca 5220 

20 ggttcagggg attaacaagg tcatacaaga aatgattgcc attgggtttg tcttagtcaa 5280 

agaccagctt gttctatgat accagctgct tctttgactt agaaaggtgt tgtattaatt 5340 

attgctgtac aaccaacttg ttcccatttt atggctattc atgttgaacc tccaaaggtg 5400 

25 

aaatatcatg aaaataagat atttaagttg tgtatattaa ggcacaatta ctttttcacc 5460 

aacctaatac aatagtaaca gtagttctct gggaagaaga gtactgtgga aatgtgtgaa 552 0 

30 attgtgaaag ggggcattta taaattacca tttacttagt tgtgtatata acaatctttg 5580 

agcagagtct ggcaaacaga aaatcctcca ccactgatac ccacagtaat tagcactatt 5640 

attttcatag gaaagactgt atcaggcagc cactgctttt tcagcaaatt tcaaaataaa 5700 

35 

cgtccacacc catattggga acgcaaaatc agttaatgct tccgataaaa aagaggaact 5760 

agttgtgtgt ctcagaagtt ctcaacacat cctttaaggt tccaaacatc tactagaaaa 5820 

40 ggagtgcaga tttaaactga gtgaggtgtg gagtggggga agctgactgt gtctggaaca 5880 

aagaactttg aggaacttgc ccagagccct gcactcatca gacctgcagc agacattgca 5940 

ggcctgaaga aaggtaggtc cagggacttg ccttcctagg ctctcggact gatagtttgt 6000 

45 

ttcatttcct gggtttagga tttctgggag tggagtgaga aaaaaaaatg aggtcacttc 6060 

ccatgaccca agctggtagt cagggttggg gtggattgag tgcacattta aatatgcatt 612 0 

50 tggtgagggt tgtgggtggt tgaggaacag tcttcgctac agttgatggg agttagaaaa 618 0 

gcctcctaaa ggctgacact ggaggctgga tgaagttgtt ttagttactt ttgttccgta 6240 

agttctcagg ataccaaaac actgtaagca acattttaaa ctttgattaa ttacctcatc 6300 

55 

tactgttaat gcacttcttc catggtcttt attctttgtc tctgactggt tatttttata 6360 

attcaagact taagaaaaga tctagcacgt gcagaatttg gtgtacatgg agtagggcac 642 0 

60 aggcagggtc cccagccttg tacatttaga tgggctgtgg cctgacgtag ttctccagtg 6480 
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gtctttggca ctattcatgt ggcacccagg 
tgcaaatgaa aaacatcttg tagactttta 
5 gtagtcgtag tcattgtaat ctttccccag 

taatgtacct agaatcacac aagttaacaa 
tggttcctag cccaggcctt tccccctttt 

10 

atcatcctcc taaaaaccat ccagattcag 
aactctgcca agaaaaaggg gatgaccgtg 
15 gtgacccata agggggatgg tgacataaaa 

gatccacaag gagactgggg ccaatagaag 
tttatccggg gtggggagga ggggcgggga 

20 

tataggtgat gtggcaaaac caaggtttag 
ccctcatatg tatccaatat tctctgcagt 
25 tggagtcttg taaacacacg ctttctttcc 
cacatggatt tctgtggtgt aaagatagga 
cttataaaga cagagcacca gccttgaatg 

30 

tatggaaaat cataagagtc aattataagc 
gaagcttaca gccttacccg agaggcacta 
35 gataagacag acccaacatg tcctaaattc 
gccgagtttc ccttacatta aagaataggc 
gcactttggg aggcccaggt gggtggatca 

40 

aacacggtga aatcccttct ctactaaaaa 
gcgccggtag ccccagctac tctggaggct 
45 cggagcttgc agtgagccga gattgcacca 

ctccgcctca aaataaataa ataagtaaat 
tagatggact aagaagaatt atcttcagtt 

50 

taaaaggaaa ataacataga tttcttaaat 
acatgcggga aatgggggct gtaggagaat 
55 aactgttagt gtgtgagaag attagatgct 

ggtcatggcc gggggagtgg aacattcctg 
atataggaat gaaattactt tagaacaact 

60 

aaaataaaaa ggagaaattc aaggtaaggc 
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cagtaaactg 


aggctgctca 


tgataaaatc 


6540 


gcattcttag 


gataagaaag 


aagcttgtaa 


6600 


atgaagtaaa 


tgaaatccag 


aaaagggaaa 


6660 


aagggctagg 


agcaaaaaac 


ccagaattct 


6720 


cccttcccat 


gccaccattt 


ctgcatcccc 


6780 


gcctcatttt 


gggtttctag 


tctatttgat 


6840 


tataccactc 


tcctcatgct 


ctcttgtgtt 


6900 


atgaggaaac 


aagagactgc 


tggtgctgct 


6960 


agggaagggt 


ggtctaggtt 


accccacctc 


7020 


agacattgta 


gttgtcaaga 


acctgatgca 


7080 


attcttctcc 


aacctgaaag 


aaatgtccct 


7140 


taaaaaatat 


atattttgat 


tgtagagaag 


7200 


ttggggacct 


tatctaccct 


agataattaa 


7260 


ggcagttata 


taaatttatg 


tcaagaatgt 


7320 


accagaacct 


gacctatgtt 


accaaagaat 


7380 


cttaatttga 


tataatggct 


cggccctcag 


7440 


aacacagagg 


gagaactacc 


aagtgtgttt 


7500 


tcagtctgaa 


ggtgtgcgtg 


agagattgag 


7560 


cgggtgcggt 


ggctcacgcc 


tgtaattcca 


7620 


cgaggtcaga 


ggatcgagac 


catcctggct 


7680 


tacaaaaaaa 


ttagccaggc 


gtggtggcag 


7740 


gaggcaggag 


aatggcgtga 


acccgggagg 


7800 


ctgcacgcca 


gcctgggcaa 


cagagcgaga 


7860 


aaataaataa 


ataaataaaa 


agtacacatt 


7920 


tctttcaaaa 


taactttggt 


ttctgtggtg 


7980 


ccttagttac 


cactagaggt 


tgagtaatcc 


8040 


ggtgtctgag 


agaagacaga 


aactatgcaa 


8100 


gtggtgggtt 


tgttgctatt 


cttgtaaggt 


8160 


caaacgggct 


caacatatgc 


aaatgtctca 


8220 


taaataagtc 


caatatactt 


ggggctttaa 


8280 


tacgataata 


gatatgggtt 


gaagtagcag 


8340 
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caatatcaac aacattaaat ttttatcttt 
actttaaagg ggagggtgac acagttgtct 

5 

gggtgggttg aacatccaag gctacaaatg 
aatactcctg ataaaaatag gatgatggct 
10 taaaaaatgg ttgcgtagaa ttagaatatg 

aggtatcaat gacaaactac cagacattta 
tctaccaaga aaggacattt gcagtcgcat 

15 

tcttgcctct cttgattagg gactgcttgt 
ccttgtgcca ccttccttta tctgaaagat 
20 tttggtcttc taatgtttga aataaaaatt 
aatgatcaaa ttatgggctt tattatagtg 
tctctcaagt tcagcaaaat ttgagggtat 

25 

gctattttat tttattcgtt tgctccaaat 
gaagtctact tataggtttg attataccta 
30 taaactccat tatcattaga acagagatta 
ctctacatag tcttgtatct acctagtcca 
ttctgaccaa tctttgagac aaactgcaaa 

35 

gtaccatgaa taacatgctc tcaccagtta 
cccccaagac aggacaggtt ggagcagggt 
40 gtccttgagt ggtaggaagt gttagaggaa 

agtaagtgat agtaggaaga ccaataataa 
gacatgaact ggttgaaatg tttacgtgtg 

45 

atgctgtatt attcacaggg tagagttaca 
agtggcaggg ccaaaattta aatccgggta 
50 gatccactac actagtctgg cgaatagaag 

tcagttgttg gacatttggg ttgtttgcac 
aaaattcatt ttacaagttt ttgtgtggac 

55 

gggaatggaa gggataggtc gtaaggctgg 
tagggaggaa tcctccttat gttttaggaa 
60 tattagctct tctgtggtat aaacacttct 
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atactaaagg 


tggtgaaatg 


caaagaaagg 


8400 


ttgcatttga 


tcatttggct 


acaaagtcag 


8460 


gacaccgggg 


aacattcaga 


ggctgtaagc 


8520 


tggcacgcgg 


agatgggaga 


aatacagagt 


8580 


attgatggag 


catagaaggt 


atgtaacgga 


8640 


acacagacaa 


ctgagctaat 


tgttgggcct 


8700 


agccctctta 


gcattcctgt 


tacctttagc 


8760 


agtgttctga 


agtaatctcc 


acataatatt 


882 0 


aaattttgag 


gaagactgaa 


tatcttctaa 


8880 


acctcttctc 


tttgctggaa 


caatttgata 


8940 


acttcaaaat 


taaccccaga 


gggggacaat 


9000 


ggattcagtt 


taagaccaaa 


aataaggcct 


9060 


ctaaattttg 


tattctgttc 


aggggatgct 


9120 


ctcaataaaa 


tgtaaatgta 


agtcacctca 


9180 


caagaggaag 


aggagggcct 


cttggcccac 


9240 


tgagagcaaa 


aacaagcaca 


accctgactc 


9300 


agagaggaag 


agtcaacctc 


tgacaacacg 


9360 


tatagacccc 


tcctttctgc 


tttttcactt 


9420 


catgccagga 


ataaagcccc 


tctttagcat 


9480 


atttatgttt 


tatacttcaa 


gccagaaatg 


9540 


tagtcaatag 


gtagttagta 


tttattaagt 


9600 


ctatttcatc 


attaaatttc 


tctacgagat 


9660 


gaaacacaag 


attaattaac 


ttgagtaagg 


9720 


atttaattcc 


agaactcgtt 


tgcttaagag 


9780 


aactctttta 


tttgcacatt 


tattcattcc 


9840 


cttttgacta 


ttataggtaa 


tactgctagg 


9900 


atatgttttt 


atttctcttt 


gtatgtatct 


9960 


ttcatggttg 


acattttgca 


aagaatcagt 


10020 


tagtttcagt 


aggattagct 


ctttggtatt 


10080 


ttgtacatgt 


gttagaattt 


ggatgtgaat 


10140 
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ctatagggct cagtaatcta atcttccacc ttaataaact ggaaaaagaa gaacaaacta 10200 

aatggaaaca acttgtagaa aagaaataat aaagactaga gtgaaaataa tagagagtag 10260 

aaacacatta gagtaaatca acaaaaccca aagttcattc ttgcaacgat caacaaattt 10320 

gaaaaacctt tagctatatt gaatgacaaa aattacaaga agaaatcact aaaatcagga 10380 

atgaaagagg cagcataatt accaaactta cagaattaaa gaaattataa gggatactat 10440 

aaaattatat ccaaataaat cagataacat atgaaatggt caaattttta tgaagatact 10500 

actaaaactg actgaagaag aaatacaaaa ataagaagaa atacaaaatc tgaatagatc 10560 

15 tgtaataaga gattaaatga ggaattttaa aaacttttat aaaggaaagc ccaggctcct 10620 

gtgatttcac tggtgaattt tactaaacat tttaaaaaga attaataccc ttccatcaaa 10680 

aactagaaga gaagggaaca attccaaact tattacataa ggctagtatt attttgacac 10740 

20 

caaaatcaag aaaagacaac ataagcgaac aaaactacat atgaatgtct ctcatgaata 10800 

cagacacaaa tgctaaaaaa aaattagcaa gcagaattcg gcaacatgta aaaaagatta 10860 

25 tatatgagga caaagtcaga ttaatctcaa gaatataaaa tttagcattc aaaaaatcag 10920 

tgttacataa tgcattaata gaataaaaga caaaaaccgc aaaaagccac cacgaatgcc 10980 

tcaatagatt aaaaaattaa caaaatccaa tactttttat gataaaaata ctcatccaat 11040 

30 

gacgaataga atggaatcac tcccccacca aataaaaagc tattagaatt aatgaatgag 11100 

ttaagcaagc tcgaaggata caaataatat acacaaatga acagacactt caaaataaga 11160 

35 tgtacatgcg gccaacagtc atatgaaaaa atactcaaca tcactgctca ttagagaaat 11220 

acaagtgaaa accataatga aatgccccct tacaccaatc agaatgacta ttaacaaaaa 11280 

gtcaaaaaaa taaaaataac agatactaac gaggttatgg agaaaaagaa atattaatac 11340 

40 

actgttggtg tgagtgtaaa ttagtttaac cactgtggaa cactgtggtg attcctcaaa 11400 

aacctaaaaa cagaaatacc attcaaccca tcaatcccac cgtgggtata tacccaaagg 11460 

45 aatagaaatc attctatttt aaagacacat gcacacatat gttcattgct gcactattca 11520 

cagtagcaaa gacatggaat caacttaaat gtccatcaat agtaggctgg ataaagaaaa 11580 

tgtggtatat atacaccatg aaatgctatg cagccataaa aaaagaacaa gatcatgttc 11640 

50 

attgcaggaa catggataga gctggaggcc attatcctta gtaaactaac acaggaacag 11700 

aaaaccaaat actgcatgtt ctcacttata agtggagcta gatgatgaga acacatgggc 11760 

55 acataaaggg gaacaacaca cactggggct tattagaggg tggaggctgg gaaaagggag 11820 

aggagcagaa aaaataaatt ataggtacta ggcttaaaac atgggtgata tggttgggcg 11880 

cagtggctca tgcctataat cccagaactt tggaaggctg aggtggatgg atcacctgag 11940 

gtcaggagtt caggaccagc ctggccaaca tggtgaaacc catctctact aaaaatacaa 12000 



60 
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aaattagccg 


ggcgtggtgg 


tgcatgcctg 


5 


gagaattgct 


tgaaccctgg 


aggcggaggt 


ccagcctggg 


tgacagagca 


agactcggtt 




gataaaataa 


tctgtaaatc 


aaaccctatg 


10 


rr1~£> r~" c r* i~ n 
y UaL<L>OL uy a 


CLM ' — Ldcicia i_ ct 


aaaattaaat 




agttctatat 


gctagcactg 


aataatttaa 


15 


aatagtttca 


aatagaacaa 


aatttatcaa 




acaaaatgtt 


aaaagaaatt 


aaagaaaatt 




aactggaagg 


cttaatattg 


gtaagatggt 


20 




a tdaacicia L, l. 






atcatatgga 


aatacaaggg 


actcagaata 


25 


ttggtgcact 


tacactttcc 


aattcaaaac 




tggtattggc 


ataaggacag 


acatatagaa 




aaaataaaaa 


tgggatggaa 


tagaatcatc 


30 


LyaL u u l uy a 


ci ct y y ci i_ ci cv 


araari"apt*a 

CL W d CI V-> *— C*. \~< CL 




catgtttgaa 


caacttcaga 


ttcacatgca 


35 


aaacaaaaca 


gctcaaagtg 


gagcaaaaac 




ccttctattt 


aggtatattg 


agacctatat 




aaattttaga 


tatagagcaa 


agagataaaa 


40 


Cl U L LCICI L.CIL'O. 


tarrt"trcaa 

^ ^ 1 — > — k — • d. C*. 


ttttattaaa 

U U U U I— t*. t^L t*. 




agtcattaca 


aaaaagccca 


tagtttgatc 


45 


tttatttcaa 


acatttgaat 


acctaaatag 




ttcaaaaaag 


taaggtggtg 


caagggacag 




tagtcgctaa 


tcaagagtgg 


caagatctaa 


50 




L» V- L- l_» 1— ^ 1 — 


att*aaaaarr 




atctggaggt 


ttaagatcag 


tcactgtggc 


55 


ctgaggtaag 


aagattgctt 


gaccctgtga 




actctgtctc 


tacaaaaagc 


acaaaaatta 




cagctacttt 


ggaggctgag 


gtcaaggctg 


60 


agcttgtgtg 


atagagtgag 


accctgtctc 
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tagtcccagg 


tattcgggag 


gctgaggcag 


12060 


tgtagtgagc 


tgagatcaca 


ccactgcact 


12120 


tcaaaattta 


aaaaaaaaaa 


aaacctggat 


12180 


acacaatacc 


tgtatgacca 


acctgcacat 


12240 


taaatatata 


tacatacaaa 


aataaattgg 


12300 


aaatgaaatt 


aagacaacaa 


tctcctttac 


12360 


aataagcaca 


agacttgtac 


atcgaatact 


12420 


taaacaaatg 


aaaaaaaact 


catattcagg 


12480 


aatattattc 


aaattgacct 


acagattcaa 


12540 


tatttcagaa 


atttgcaagc 


taattccaaa 


12600 


gtccaaacaa 


ttttgaaaga 


gaaaaatagt 


12660 


ttactacgaa 


gctataattt 


taagacagtg 


12720 


caatggaata 


gaattgaata 


aaattgtcta 


12780 


tagaaataaa 


cctctatatt 


tatgactagt 


12840 


agtgagaaaa 


gaatagtctg 


tttagcaaat 


12900 


aaagaatgaa 


gttggacact 


ttcttacacc 


12960 


caaaatgtaa 


gacctaaatg 


taactaatac 


13020 


gtagacttta 


gcatcctgtg 


agcagaacac 


13080 


taaaatagca 


ggccttattt 


ttggtcttaa 


13140 


cttgggagaa 


ttgtcctcct 


ctgggcttaa 


13200 


atttaccaac 


ttgtcccgga 


agggagacaa 


13260 


aagatatcca 


gtcttcctca 


aaatttctct 


13320 


tatgtggaga 


ttatttcaca 


accctacaag 


13380 


agggaacagg 


aaccctaaga 


aggctatgaa 


13440 


caacaattaa 


ttactttgtc 


cgtgttaaaa 


13500 


tcacacctat 


aatccaagca 


ctatgggagg 


13560 


atttgagaac 


agcctgggca 


acatggcaaa 


13620 


gctgggtgtg 


ctggtgtgca 


cctgtaatcc 


13680 


cagtgagtca 


tgataacgcc 


actgcactcc 


13740 


aaaaacttct 


ggtagtttgt 


cattaatgcc 


13800 
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ttctgttacc tacactgtat gctcaattgc 
tgacagtttt tgattagaag tttattttat 
5 tgactactat ttgcatggag tatttcatcc 

gctaaagtga gactcttata gcatatagat 
tgattgaaga atttaattcc tttatattta 

10 

gtgccatttt gttatttggg tggggggggg 
atctcttgtc tccctttgtg gttaggtact 
15 tatctttcat gtatctgctg cagttctaca 

aagggcatgt atgacataaa cctctggtga 
gtgcacaaag caaagagaga gagatggggc 

20 

tgccatagct taccctctcc catgataatg 
ctgggtgtaa ttacctctga aaggtcacac 
25 tttcacatga gttttagagg ggacagtgaa 
actcatgttc ttctcacata caaaatacaa 
tacccattct agcatcaagt ctaaagttct 

30 

tgagattcaa ggtatgactt atctggaggc 
aaaaccagtg atttgtttcc aaaatataat 
35 gttcaaaaga gagagccagg caagaaaaag 
caacagagaa aacagaatta tgttttaaat 
ctgaatcctg ggagtgctga ttggggccct 

40 

ggactcagtc cgtgcttcag ctctgttagg 
ctgagatctc aaaggtagcc ccactcccaa 
45 tctctgtgtt ggctctgact ccgcattccc 

aagtggctcc actcccatag ctatatgtga 
gcccctacat ctcttgcagt ctgtgagcct 

50 

ttgtgccttc caggcacagg tattccaggc 
tcagggtctt gcaaaactgc tgtggcatat 
55 gatgcaggtt acccacaaag ctaaggctgt 

ccatggagca gggatggaat tgagactgat 
tccagggaaa gaggggtgct ttggaggctt 

60 

tttggttcca gagacaatcg ggcatagcag 
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tcattaaatg 


atgaccttct 


ttgcctcttt 


13860 


ctgacctaag 


tacaggcagt 


ccttctcctt 


13920 


cttcactttc 


agcctgtgtt 


ttttcttaaa 


13980 


ctgttttttt 


ttttaaattc 


agccactctg 


14040 


aggttattat 


taataggtaa 


gtagttagta 


14100 


ttgtttttgt 


agatatttgt 


ttatttcttc 


14160 


tttatgaagt 


actaagcttt 


taattttttt 


14220 


gttcttatag 


ttctagaggc 


tgggaagtct 


14280 


gggtgatccc 


atggaaaaag 


atggaagtga 


14340 


caaatttatt 


ctttatcagg 


agtccactcc 


14400 


acatttatcc 


attcacgagg 


atgaaggtct 


14460 


ctcttaaaac 


tgttacaatg 


gcaattaagt 


14520 


atcatagcat 


ttcatccacg 


accctccaaa 


14580 


taattctgtt 


tcaatagcct 


caaaagtttt 


14640 


aagtccagag 


tcatcagaag 


cagatatgga 


14700 


aaatgtttcc 


agctgtgagc 


ctgtgaaatc 


14760 


ggtgagatag 


gtatagagta 


gactgtctca 


14820 


gcataactgg 


tttcaagtga 


gtccaaaatc 


14880 


ctggagaata 


atttcctttg 


actccatgtc 


14940 


cagacatgac 


cctgctccca 


tatctttgct 


15000 


ttggagttgc 


atgctagtgg 


ccttatagtt 


15060 


ggctccacta 


ggcatatccc 


taattgggac 


15120 


actcagcttt 


gccctaatag 


aggctcttag 


15180 


tattctttga 


tcctaggtgg 


agaaagccat 


15240 


gtacacatgg 


attccaccaa 


ggtttacagc 


15300 


atagtggcag 


tgcgagctga 


ggtcctaggc 


15360 


ggcacaggtt 


cactcaccca 


gttgtggctg 


15420 


aactctaaag 


cacacctcag 


taactcaagc 


15480 


tctggggtca 


gtgcacaaca 


tcatcaaggc 


15540 


gagctctggg 


gagcaggtta 


ctgttgaaat 


15600 


caattccaac 


cctgggggat 


ggagtgccac 


15660 
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atagtgttaa ctctagaccc tgggatggta 
ctggatgtag cagcagcaag gcctcaggaa 

5 

ggtttcttgg agcaggtgca gaaatgaccc 
gactccaggg gacaggtcag ttccggggag 
10 gtggggtgac gtagctcaat caaagctatg 

ttgagtttgg aggggcaggg tgcaccagcg 
ggagcctggc ctcagggagt agagagtaga 

15 

gcgtgtgcat agtgtagtgg tggttaagtc 
ctcctgaagc agaatgcact ctagccagtg 
20 caactcaggc catgggggcc agggcacagt 

tgtggacttc aggtagctct gtcagctgag 
cagtagtaag actgcaggtg tctatggtgg 

25 

ccttttccct gtaggaagaa ggccctcctg 
gtgatggaag aaaggtgttt tctctccttc 
30 tacaagattt ctgctactcc tttgctgttt 

gaatgtagtt gtttattcat ttttggtgtg 
tgtgtgtgtg tgtgtgttga tggcgatgag 

35 

acattctcta gaattctgaa gacatattat 
taaactttct tctattgtga ccattttctt 
40 caattttttt ccagttgact aattctattt 

caaaattttt tctgtcaaca ttattttata 
catattgtgt cactttttgc actttaactt 

45 

taaattatcc atacttttta catattctat 
gtgggacttc ttggaggctt gtatcaagca 
50 taagacttat tcagagacta attgctttct 

tactactaac tttgatagtt ctcagaacta 
tgatgcattt aatttctagg attatgatca 

55 

cttatctttg ataagtataa tctgaaatct 
gtttctacat tttcccccaa atttatcctt 
60 cctgacttat agagatttta gttgtttccc 
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gggcatggca 


atatcccagg 


ctgtgtaagg 


15720 


tgactaggcc 


tgtctgtagc 


tttttccttg 


15780 


cctggggagt 


cagggtttct 


cagaagcttg 


15840 


gtggggcact 


gcagctattt 


tgcctggatg 


15900 


ggtgcccaga 


gagtggagca 


cagtttcaac 


15960 


actaagatgg 


gggcatggag 


cagttccatg 


16020 


tagcagcagt 


ttggctcagg 


tatggcttcc 


16080 


tcagagatgg 


aggggtgcaa 


tggctactca 


16140 


attcagttcc 


aagaaggggc 


agcacagtag 


16200 


gtcagctcct 


cctctgggat 


agaacagaca 


16260 


ctcagagcct 


gtgaagactc 


tatgtgtctc 


16320 


tgatgggagc 


tgctgggatc 


ctgtcactaa 


16380 


actcagagct 


ggttctaact 


gggggaaggg 


16440 


ttcctgcagc 


cattctgggt 


ttctgtgctc 


16500 


tctggagctc 


ttccttagtt 


attttggttg 


16560 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 


16620 


gttagtaact 


tctaattggc 


catcttgctg 


16680 


agacctttta 


attctaaact 


tctaatcttt 


16740 


gtttttctgt 


agaattcttg 


gtgatttctt 


16800 


tcagctattc 


cttgttattt 


aacacattgg 


16860 


tttgaaattt 


gtttgagaca 


atgttgcaaa 


16920 


ttcatttttg 


tgatttcatt 


ttcaattgaa 


16980 


atgtgatata 


tatccagatg 


ctcctaatct 


17040 


tatctacttc 


cttctttatg 


gaatccaatg 


17100 


aagcatgtct 


ttttgttggg 


atttctcctt 


17160 


taattctcca 


gggaagtgat 


aggtatcagc 


17220 


ataatttatg 


ctgtacacct 


caaggagact 


17280 


gtggccatag 


tgtcctgttt 


ctttgatact 


17340 


atgcaatctt 


tgtttttaac 


actcatttca 


17400 


actttcataa 


caaaatggag 


ttctaattct 


17460 
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gtttgtccaa 


attgtttttg 


gaacttttct 


agatgagata 


agaataagaa 


cattgacttt 


17520 


ttggttacat 


tcttacattt 


aagtccttat 


cttaactact 


tcaatcacat 


ttacattatt 


17580 


tatttcttat 


tattaattat 


atttatcttt 


cttttaaaaa 


tacttaactt 


cttattttat 


17640 


aatttgtaat 


tttgtacttt 


ttattttact 


gcttttcata 


attttgaatg 


tatacattac 


17700 


taaaattaat 


gctctaaatg 


attaatcttt 


tattttcatt 


ttaacttttc 


ccctctagct 


17760 


ttctctgaaa 


ttcaccctcc 


ttctcccaac 


tttaatcaat 


tttattgcta 


gatagctaat 


17820 


aatgtatttt 


cattgatcta 


tcagagataa 


atacacgatg 


acaattattt 


ccacagaaat 


17880 


ctgaattttt 


tctgtacata 


gcaaatttgg 


actacaataa 


cttttttttt 


catttatgtt 


17940 


accaataaaa 


gggcacacat 


ctccaaacct 


gacagttttt 


agcatatatg 


ctccactgaa 


18000 


atcaactatg 


acttgaagga 


tattagcaag 


cttcacttcc 


ctaaaatttt 


atggaagtgg 


18060 


gccatgaaac 


atctgaatat 


caattcaaat 


acataggtaa 


aaatttccat 


gtggctttag 


18120 


cctatctcca 


cattacttgg 


ttggaatcat 


tatactgagg 


agagacagag 


gcataaaagt 


18180 


aatatctttg 


tgcttggaat 


ttcaaaaaat 


ggctttatgt 


attcttgcta 


tgcaccagca 


18240 


agttagaatt 


taaagcactg 


taataggaaa 


agagatttaa 


atttgtggtg 


ccaagaatat 


18300 


gactcagagc 


tgtaagtttc 


attcccagag 


ttcagagctt 


atgggacaaa 


atacaaggtt 


18360 


acatgggaag 


attttcagcc 


ctgagaattt 


agctgtgctt 


tggttataaa 


aggaggaact 


18420 


atacaatact 


gtacaataaa 


ttggaaggca 


gatgaatttg 


tgtagcttat 


tactattata 


18480 


aaagtaatcc 


ttcatgggga 


agtggctgta 


atcaagagac 


taggcattaa 


aagggataga 


18540 


aataaaatct 


gagcagactt 


gggatttgaa 


gttgaagaaa 


tttctctaat 


ttacctcact 


18600 


tctgtcaaca 


tgtcctctct 


cctagcatat 


tattgatatc 


atataacatt 


taagtctctc 


18660 


tctctctctc 


tccacccccc 


acttaatgtg 


ttgcctcttc 


cacttcccta 


gctctaacaa 


18720 


ctattataat 


attggagggg 


aatgtagacc 


aggttttgtc 


cttagggagg 


aatctgcccc 


18780 


aaagtgtcaa 


gactttcttt 


tgtcatttcc 


aggacctcat 


cctaaattac 


atttcttatt 


18840 


gtagcacctt 


ttctgctgcc 


atgacaacca 


tgcaaggaat 


ggaacagacc 


actccagggc 


18900 


ctggccctga 


tgtgccccag 


ctgggaaaca 


tagatgtcat 


acattcatat 


ctgtgtaaag 


18960 


gattgcaaga 


gaagttcttc 


aagaggaaac 


ccaaagtcct 


tggggtaagt 


caattgcaaa 


19020 


tctgggagaa 


gaccaatagt 


gttgtaaact 


cacgtttgat 


cagagctaat 


gcccaataca 


19080 


ctgtcccgct 


agcctcataa 


tacatcaacc 


agccctagta 


acatcagaat 


ttaatttcag 


19140 


gccctttctt 


tgaaaaaata 


tataatattc 


tatcacatcc 


tgcttccaaa 


ttaattatat 


19200 


ttaaaatata 


ttgacttcag 


atctattcta 


gggttgtggg 


tcagccatga 


taattatatt 


19260 


taacttcaat 


atagaagaac 


ggcaaaagaa 


aaggcccacc 


tgtccactag 


ctgactcccc 


19320 
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ctccacaggc cttcctgaca gctgatagat 
tttcttagcc tactgctcag aggatgaaaa 

5 

agtctcaggc gaacgtgtct aaaaacaata 
gtgaaagaac attaaatata ggtttatcac 
10 tcactgtgga ttctttggat tccttctctt 

atatagttac ttctcataaa atatgttatt 
aatgtatgga taattatttt aaaatatact 

15 

aacaattgag agcaaagtac aggtatatcc 
cttttcaatt atcaacttcc cccaccagag 
20 gttgacacat cattactcaa agtacacaaa 

gttctatggg ctttaacaaa ggcatagtga 
atcccaccaa tccctggcaa tcactaatct 

25 



35 



aaagtcatac 


agttaacatt 


ataaattagg 


ataacatgca 


tttaaatttc 


ctctatgttt 


gtactgaata 


atatttcatt 


gtctagaggt 


aaggacatct 


tgatggcttc 


caaactttgg 


gtgtgcaggt 


tttgtgtgga 


cataagtttt 


ttgtattgta 


agagtatgtt 


tagttttgta 


ttgcaatttt 


gaagatttca 


gtcaacaatg 


ctagcatttg 


atttcctcaa 


tgttctggat 


atctcattac 


cattttaatt 


ttcatttccc 



tatgtttatt taccatctcc atatcttctt 

45 

cattttttta attgggatgt tagtattctt 
tgaataacaa tcctttgtca ggtatgtttt 
50 atcttttctt ctgacagtgt cttctgcaga 

atgattaatt cggtatttca tgggttgtgc 
aaaaccaagg tcatctagat tttggcctat 

55 

ttcaacattt gggtctgtga tttattttga 
gtctaagttc aatttttgcc tatggaggtc 
60 aagactattc ttctaattgt attatcttag 



gtatattgct actctttctt tcttgcagct 19380 

atctgctatt gtctgagcat ttccttctgg 19440 

tagaattaaa acagtgacag gttagggaag 19500 

tagcccaagc ttcaaaccac .tgagaaaatg 19560 

tttttgcatt attcattttt ttggagagaa 19620 

tatgttggca catactaggt tcaattttta 19680 

gtatgtttta gagcagtttt aggcttaaag 19740 

catatgcccc ttgccccgac acatgcatag 19800 

tgggacattt gttccagttg atgaacctat 19860 

tcacaatggg gttcattctc tgtgttgtgt 19920 

tatgaatcta cctttatagt atcatacagt 19980 

tctatcgtct tcatagtttg ccttttccag 20040 

tagtttttaa agactgcctt ccttcaccta 20100 

tcttatggct taatagctca tttattttta 20160 

accacagttt atttatgcat tcacctacag 2 022 0 

caattaggag taacgttgct ataaatatct 20280 

caactcattc gggtaagtat gattactata 20340 

agaaactgcc aaactgtctt ccctgttact 2 0400 

aaggagagtt cctgctgttc tacataattg 2 0460 

tttttaccat tctaataggt gtgtagtcca 20520 

tggtgacata agatgtgggg catcttttta 20580 

tcttgaggtg tcttattaaa gtttttgaca 20640 

aatgttgagt ttgaatttcg ttgtatattt 2 0700 

ttgcaaatac cttctgtcag tctgtggttt 2 0760 

gcagaatttt ttaattttaa taaaattaag 20820 

ctttggtatt gtattgacaa actcatcacc 20880 

gtcatctgtt aagagtttca gagttttgag 20940 

gttaattttt gtgaaggata caaggtatgt 21000 

cagtttttta agtagtattt tttttttgga 21060 

ttctttatca aagatcactt gactatactt 21120 
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ctgtgtctat ttctaggctt tctattctat 
aatatcacac agtcctgatt actatcactt 
5 ctcctaaaac tccttcttct tcaagattgt 

tataaacttt agagtcagtt tatcaatata 
attttgagat ggagtctggc gctgtcaccc 

10 

cactgcaagc tctgcctccc gggttcacac 
gggactacag gcgcctgcca ctatgcctgg 
15 gtttcaccat gttagccagg atggtctcaa 
cctcccaaag tgctgggatt acaggggtga 
tgagggagat tgcattaaat ctacagatta 

20 

tgagtcttcc tatccatgga catggaatat 
ttttttaagt ttctcttata gattttgtac 
25 ccaaatacca atacctaaat atttaatttt 

tttaatttca aatttcactt atttactata 
attaagcttg tatcctacaa ccttaatata 

30 

gattctttca gatattctac acagacaatc 
tttttcctta ccaattgtat acctttattt 
35 cttccagtaa atggtaaaag gagtagtgac 

gtgagaaagc tttgaatttc tccattaagt 
attctttatc aacctaaaca agttatcctc 

40 

aaacaggcat tggattttgc tcaatatttt 
tttcttcctt ttagcctgtt gatgtcatta 
45 cagccttgca tatctgggat aaatcccata 

tattagattt agtttgcaca tgttttgttg 
gttggtctgc agttttcttt tcttgtaatg 

50 

tgctagctcc atagaatgag ttagggagta 
tggtataatt tcattttcaa gtgtttggta 
55 tctgttttga aaattaatta attatttatt 
gagtgcctat ttcttcttgt gtgaattttg 
ctcaaggtta tcaaattgtg ggcacagaat 

60 

attactatgg gaactgtagt gatattcccc 
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tcccttgatc 


tatttgtcta 


tttctttgcc 


21180 


tatagtttgt 


cctaaagatg 


ggtagtgtta 


21240 


ttttgttatt 


ctaagtcttt 


tgcctttccc 


21300 


cacaaataat 


ttgctaatat 


atatatatat 


21360 


aggctggagt 


gcagtggccc 


aatctcagct 


21420 


cattctcctg 


cctcagcctc 


tggagtagct 


21480 


ctaacttttt 


gtatttttag 


tagagatggg 


21540 


tctcctgacc 


tcgtgatccg 


cccgcttcgg 


21600 


gccaccacgc 


ccggccattt 


gctaatattt 


21660 


aattgggaag 


aactgatgtc 


ttgacaatgt 


21720 


ctctccattt 


atttagttat 


tcgacttatt 


21780 


atagtttgtt 


agatttatgc 


ctaacaaaca 


21840 


tagggttgct 


aatttaaata 


gtattgcatt 


21900 


agtacataag 


aaagtaattg 


actcttgtgt 


21960 


atcatttgtt 


aattccagga 


gtttttggtt 


22020 


acgtcatctg 


tgaataaaag 


cagttttatt 


22080 


ccttttcttg 


tcttaccatt 


tcagctatga 


22140 


aggcttatcc 


ctgccttgtt 


cctgatctta 


22200 


atgaatttag 


ctgtagtttt 


attaataaat 


22260 


tattaatagt 


ttttcaaaag 


tttttataat 


22320 


tcctgcatct 


gttgatataa 


tcatgtgatt 


22380 


gttacattca 


ttgatttttg 


aatgtctagt 


22440 


tggttatggt 


atataattct 


tttgatacac 


22500 


ggaatttttg 


catctttgtt 


caccagagat 


22560 


tttttgactt 


ttttttttgg 


tagataatag 


22620 


tttcctcttt 


ttctgttttc 


tgtaaagaat 


22680 


gaattccaaa 


cctttccagg 


cctggtgctt 


22740 


ttatttattt 


aataattata 


ggcttattca 


22800 


acaaattgtc 


tttaaggagc 


aattcattgg 


22860 


tgttcgtaga 


gttattttat 


tatcctttta 


22920 


ctttcactta 


tgagtaatct 


gtgtcctctc 


22980 
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tctctttctc 


ttagactggc 


taaaggctta 


5 


taaaaaaaag 


cattggattt 


cattgagttt 


atttctgctc 


taatttctat 


tacttttttt 




ttctttttcc 


agtttcttaa 


ggtagactct 


10 


atatattcat 


tcaatgttat 


aaattttttc 




tttgataagt 


tgttttaatt 


ttttgaccca 


15 


gctcctattt 


taagaacgct 


gtttactcat 




aatatcagat 


ttgctcttct 


ttttttcttc 




cttccccttg 


ctacaataat 


tgctttctaa 


20 


caaaagagag 


aaacagttgc 


ggtagaagta 




gctggttctg 


gcatactcct 


tttaaccgtg 


25 


gataatattg 


gtaagataat 


attcatggtt 




tatcaaggtc 


atatgactat 


atcaaatgaa 




gggagtggca 


taagcaccat 


ttttgtggtg 


30 




att ctgattg 


cct tcratQaQ 




tgcatttagt 


tcttatgaag 


aacatcccat 


35 


gtcagtgatg 


gtaagcaggg 


tattttttga 




aataccctga 


tctttgcaat 


tttattctaa 




tggtatctaa 


cctttttaaa 


ctcaattttc 


40 


acttaaaact 

C4- lw L4 U W ^-H *w 


aggcgattaa 


atact ttata 




attgccaaag 


acccaaactg 


gaatgactta 


45 


agattagggg 


cttatcattt 


ccagaattgt 




gactgctagc 


agactatacc 


agaggggaga 




tttatataca 


tatgcccaaa 


tcaactatca 


50 


taaactaatt 


aagatttacc 


tc tgtagctg 




aaaaactgaa 


gaaaatgatg 


gtttagttag 


55 


acagtttcta 


atatagagtc 


tactgactat 




ttaaattgtt 


atttgataca 


ttttgtttca 




ttaaattaaa 


attagaattg 


gaatttaagt 


60 


gagagttttg 


atgcaaattg 


aaagtaaaat 
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ttgattttat 


ttatgtattt 


ttcttttttt 


23 040 


gtctattaat 


tttctgtttt 


taattttatg 


23100 


tcttccgctt 


acttggagtt 


taatttgctt 


23160 


caggtgattg 


attttgatca 


ttattttcta 


23220 


taagcactgc 


tttcactaca 


tcccataaat 


232 80 


tttataatgc 


catgagataa 


ttgtactcca 


23340 


gttcatgtcc 


tcaggcccag 


acgtcaagaa 


23400 


ccttgtatac 


actccttttt 


cttcaggtta 


23460 


ttccttggcc 


acttttctct 


gagtgtgtga 


23520 


gcatcctagg 


tatcagaaaa 


gcacctctgg 


23 580 


aaatgattga 


cagtggtaga 


gatgctgaga 


2 3 640 


ctttatgtga 


aaaaaagagt 


gacaacagaa 


2 3 7 00 


aaagaataag 


gatagcttag 


tgattggaaa 


23760 


gataatcagt 


ttttgctttc 


atcatttcta 


23820 


ccttagcatg 


ggaataataa 


tgatgtgtgt 


23880 


ttttgtgtat 


gttgcgtaca 


caatttgggg 


23940 


tataattgga 


tataacataa 


tagaggtgta 


24000 


ttctatcttg 


taattctgta 


aatttgagaa 


24 060 


tcatatttaa 


aatagtaata 


ataatataaa 


24 120 


taagtcagtt 


aggatatatt 


acagcaacac 


24180 


agcaatgaaa 


gcactgattg 


tttcacaaca 


24240 


ttaatttggt 


ggtgtaagga 


tttcatcaag 


24300 


tggtagacac 


atctcatttt 


tgataaatat 


243 60 


gtaagggtaa 


tggacttgcc 


attactggct 


24420 


ggaggagtct 


acactacctt 


ttagctcagg 


24480 


caaggaaaaa 


gagatgtggc 


tggtaaacag 


24540 


caaatctcag 


gaaattgtta 


ccttagaaaa 


24600 


taagtttaat 


aacagtgact 


atgattaata 


24660 


ttaaaaacct 


tttcccaaga 


gaaaacacaa 


24720 


tgagaataaa 


ataccaagtc 


ttacctattg 


24780 
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acaaaagaaa aataaaattt atgaggcggg 
cattataggg tttccattta tttccaaaac 
5 agagcttaat ttttgctccc cttcactcgt 

gtgtattagt cagggttctc tagagggaca 
aaaggaagtt tattagggag tattgagtca 
ccatctgtaa gctgaggggc aatgaagcca 
tctgatgttt gagggtagga aacatgcagc 
15 ccagtctagt gtttctatgt tcttctgcct 

tagattgtgc tctcccagat tgagggtggg 
ttaatctccc ttggcaaccc cttcatagtc 

20 

atccaatcaa gttaacattc aatattaacc 
ccacacacat ctcctgaagt catacatgat 
25 caaaattatg cctaacataa tgcagctatt 
tatacaaatg ctattaccta aagttaacaa 
cttatgtcac atggtaaaga aaaaagaaag 

30 

aatgtataca tgcacaaaca tgttcttaac 
cggtcttcat ttctgcaact gatcacatgg 
35 taccaccgat tctgtattcc ctttgccttc 

ttttcctggt ggagtgacca aaaccttcat 
tcctgcccag attgggttgt tgtagtttcc 

40 

ctaagagaca ccctaaggga tctcctgtat 
agtagtagac taattttatc ttgatagtcc 
45 cccttcttag ctggttgact tagaggtaag 

aacttccagc ttaatgaaat cattgttgtc 
taagacctct aggccagcag aacatagtgt 

50 

taggtcacta ggggtgatgg taagtggtgc 
cccacaaatc ttggctatag cagaaacagt 
55 cacagccttc tggagaacat tgccccagcc 

taattgtgac ttcaaaagga cattccacca 
ggaaacatgg taagaccagt gaattccatg 

60 

caacaagtga gtgccttggt cagaggcaat 
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tttatgttag 


aatcaaggaa 


tctattctag 


24840 


tgagaatgag 


acagtttaga 


gttttataat 


24900 


gtacagcgga 


agtgaaagat 


ttgaaggaga 


24960 


taactaatag 


gatagatgtg 


tatatatata 


25020 


cacaatcaca 


aggagaggtc 


ccacaatagg 


25080 


gtttgagtcc 


caaagctgaa 


gaacttgaaa 


25140 


acgagagaaa 


gatgtaggcc 


agagactaaa 


25200 


acttttattc 


tggccacact 


ggcagctgag 


25260 


tctgcctttc 


ccagtccact 


gactcaaatg 


25320 


acatccagga 


atgatatttt 


gcatccttca 


25380 


atcaagagtc 


caccccttgt 


caacctgaac 


25440 


tttcaaataa 


agaaaataaa 


gacaaaaggc 


25500 


ttttgtacaa 


cccaaaatgc 


accaatcccc 


25560 


cacttaaatg 


ctaatatgaa 


gttaataagt 


25620 


aaaataaaat 


gaagatattt 


tcttagtaca 


25680 


aaaagaagga 


ggaaatactc 


atgacaatta 


25740 


ttgtagctgg 


tattgatgac 


agccttattc 


25800 


aggaagcacc 


tcagcaggtt 


gtgttttttt 


25860 


tccagaaggc 


ctagtctggg 


ccatttgtag 


25920 


cattgacctt 


aatcataggg 


catggtaata 


25980 


tccacacata 


cttccttacc 


tccattgtgg 


26040 


aggtcagtca 


ccctagccaa 


cactgtaact 


26100 


aggagcccaa 


aatggccagg 


tggcaatctt 


26160 


tcccggtggc 


aacgttcctt 


cctctggaac 


26220 


tgtggaaaca 


ggaagcaaaa 


attttgctgg 


26280 


cacttccact 


tatacccctt 


gatttctgga 


26340 


accttatatt 


agacgctgat 


tcagagcatg 


26400 


ctgaaaagta 


ttgtcacctg 


gttgatgttg 


26460 


ttctatcaat 


ccagctgctt 


caggatgatg 


26520 


agaatgagcc 


cactgccgca 


tttctttagc 


26580 


gctatgtggg 


ataccatgac 


agtggttaag 


26640 
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gcattccatg agttcttgga ttgtagtctt 
cccatatcca aagtaagtgt ctattctggt 

5 

aagaggtcta atataatcaa cttgccacca 
gctatattga gggctcagtg ctggtctttg 

10 catagccagg tcagtgaatg gaagtccatg 

gccaccatgg ccacttcgtt catggactca 
aggctgactg gtgtccatag aatgagtcat 
tgctgaggtc accgtttgat aagcactcac 
ccactcggag aggtccattc acatacctct 

20 atcatgcttc ttccaagtcc ctgactactc 

agtatataat tgcagatctg gccatttctc 
tgctcaacat tctgcccact gggacgattt 

25 

aggaaggggc tgtagtgctg cagctgacca 
gccatctgta aacagggccc ttgtcttctc 
30 atgagaccat cagtgcaggc tgggggagag 

gcatttgagc cacttcctca tataacttac 
cacatatatg ccacttccat ttgatgatgg 

35 

tagacaagtc agaaagcact caattcatga 
gacccatagc caaccgttta gtttctacca 
40 aaaaaggaaa gtagttatct gcagaagatg 
tctgctgtga ttcatctatg ggggacttcc 
ctgacacctc aagcacaatt agatctgctc 

45 

gcatagcagt ttggacctgt tgcagagcct 
aagcctttca ggtctcttga ttaatgggcc 
50 cctccaaaac ccaaataagc ccactaggca 

gggcaaatac agcaacttat ccttcacctt 
ctgaacccct agaaatttta ctgaggtaga 

55 

ccatcctctg gcatgcaaat gtcttaccaa 
actggatccc atcagtataa tggcatcaat 
60 ggaaaagtga tcaagatctc cctgaacaag 
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gccagaagca 


ttgtgtgcag 


gataggcaaa 


26700 


gaggaaaaac 


cgctgccctt 


tccgtgatgg 


26760 


agtagctggc 


tgatcattcc 


gaggaatggt 


26820 


ctgctggcaa 


attaggtgct 


cagcagtggc 


26880 


ttgctgagcc 


catgcatatc 


ctccatccct 


26940 


ttgggcaatg 


acaggggtgg 


ctggagaaag 


27000 


cctgtttact 


tgattattaa 


aatcctcctc 


27060 


atgaaataca 


aatatcttca 


cagtttttga 


27120 


tcccaaaatt 


tatttgtcac 


caattttcct 


27180 


agccaaacca 


ttggctatag 


cccatgaatc 


27240 


cttccaagca 


aagtgcacaa 


tgaggtgcag 


27300 


tccttcaccg 


ctgtccttca 


gggatgtcct 


27360 


ctttgaggtg 


gtgcctgcat 


atcgtgcaga 


27420 


ttcctgttaa 


ttgatcctgg 


ggaactccct 


27480 


aaggcagtgt 


ggcgggaatg 


gggatcatga 


27540 


ttgtgccctc 


aggagttgct 


caagcctgat 


27600 


aatgctgctg 


tgcatgcccc 


actttatggc 


27660 


taggtagttc 


aggtcgcatg 


gtgacttgat 


27720 


aagtccagta 


acagggcaag 


agctctttct 


27780 


gcagggcctt 


cctccaaaat 


cctagaggcc 


27840 


aaaggctcct 


aacagcatcc 


ctctctgctg 


27900 


tgttctatgg 


cccaagtgac 


agagcaactt 


27960 


tctcctgtcc 


tggaccccac 


tcaaaactgg 


28020 


aagtaacacc 


cccaggtaac 


gaatgtgtgg 


28080 


ttgtgagtct 


ttcttggttg 


tagggggtgg 


28140 


agaaggaaca 


tcttgacagg 


ccccacacca 


28200 


aggtccctga 


attttagtca 


tatttatttc 


28260 


taagtccagt 


gtgcttgcta 


cttcttgctc 


28320 


ataatggacc 


agtgtgatat 


tttgtggaag 


28380 


gtcatgacac 


aaagccagag 


gagagttgat 


28440 
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10 



20 



atacccctga 


ggtaggacag 


taaaggtaca 


cttctggtgg 


gccttataga 


caggaatgga 


ttaccaggta 


ccagaagatg 


ggttaatttg 


aactgcaact 


ggagtcacta 


cttggttaag 


tccatctgtc 


ttctgcactg 


gccaaatggg 


cacccttgca 


tcttttaagt 


ccttgatggt 


gggatattgt 


tgttgattta 


ctgcttttct 


ggccttttcc 


accatagtag 


ccttcactct 


cagctgctaa 


gcatgtctgt 


gtcaattatg 


tgagtgtggg 


gacccactgg 


agccactgta 


acatgacctc 


cataagcccc 


tactataact 


ggaatcaatg 


tcagctcaga 


gtcagtgtcc 


ctttctttcc 


ccaatccagt 


tacctggtaa 


gagaaagatt 


aacaacataa 


attgtcagta 



ggcctccttt tcattcaagg ggttctgggt 

30 

agtggggctg cgattctctg tttttataat 
aagttttctg cttatgtaaa ttaagtaaga 
35 taggaacgct gtaattaatt agccaatacc 
tgctgctttg cctcttctgt ctattatggt 
gtgctgccat ttggcccctg ctgtctctag 

40 

gtagttgagt gactacagtt tctactgtaa 
agctcttcaa ggatgcaagt gctgccctct 
45 gtatatcttc tgaaccctcc cagctgggat 

tcatcacaat ctgcctaagc ctttggatcc 
cattttcagc tcacccacag tgggccatct 

50 

taaactagaa cttttttaac tctgcaagct 
gggcccaaat caataaattc agcctgatcc 
55 cccttaatat tcattcccat gcctgttttc 

caagcagttc ttttcaagtg tagcacactt 
taggggcctg ctgggacttt agttataggt 

60 

tcctaaggag aatcaacatt gtcttgcttg 
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ttgctggtct tgccagctga aggtaaattg 28500 

gaaaaaggca tttaccaaat caatggctgc 28560 

ctcaagcaat gaaaccacat ctggtacagc 28620 

ctcatgataa tccactgtca ttctccaaga 28680 

agagttgaat ggggatgtgg tgggaatcat 28740 

ggcactaatg tctgcaatcc cttcagggat 28800 

aggtagaggc agctctaatg gcttccattt 28860 

atgagtcagg ggccagtgtg ggggtcctga 28920 

cattctggca ctggggaaat gaatgcagga 28980 

agttgaacct gagctaaagc tccattaatt 29040 

ggaggatcac aatgactttt tgggtcccct 2 9100 

agtagttccc aaaaggtctg atcatttttt 29160 

aaggccagag gtctccttga ggaaggatgg 29220 

gtatagtggg atccttcttt aagaggacct 29280 

ctgtaaaatg gcatatgtct ggaaattgat 29340 

tcaaattagt gttttgtcca ctcgacctgg 29400 

attcggtaag cttcctgtca atttcacttc 29460 

agagctctac acaagtcaga ctattctgat 29520 

agctatgcct acctttcctt tgatggctga 29580 

atccaattat ttttattgca tttaaatttt 29640 

gatctgacat acagagaaag gcaatcacag 29700 

caaatctatt tcacaaagta ctggtcaagg 29760 

gaataggctt aaagtgacta atccactcct 29820 

cttcctctac attaaaccaa ggagatcagg 29880 

tttaatccat atttcagcta accaagcaaa 29940 

gcaacattaa atgcagaatc cctgcttaga 3 0000 

aactctatgt tcttccacca ttatcccaca 30060 

cagatttctg cttatataaa ttagaaaact 30120 

tcttgtgtat cgcattctga acctcacctc 3 018 0 

ctagaagcaa acaggggcct tgggagtggg 30240 

gcaacacctg aggagaaggc catcactatc 30300 
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ggggaacctg ccccgatagt cacgtaggtt 
ttgagaaata aagggacaga gtacacaaaa 

5 

aagacatcac atgtcgatag gttccgtgac 
ttattaggga gtttcaaaaa gggagggagt 
10 gtactgtaca aggtaataga atatcacaag 

cacagtaccg gggtgaaatt aaaattgcta 
acatcttatc aggagacagg gttttgagag 

15 

ggaatttcct cttcctaata agcctgggag 
ctacagcctc gaccataaga gaccggcaca 
20 caggcgcgta ttctctttcc cagggatgtt 

tctcccattt gcttttgaaa gaagagaaat 
tcagaattta aggttctctc tcttattccc 

25 

caaggtgccc agatttcata ttgtttaaac 
caattattac agggtcctga ggcgacttac 
30 aaagtaaaga cgggcatagg aaatcacaag 
tgaaatcttt acaatttatg tttagagatt 
agtattaatt tggggaactc ataaatgtcc 

35 

catggcttca gccggtccct ccgtttgggg 
cattaggcag cgcagggttt atctccttag 
40 gttagggagg ggatgttgcc actactggtg 

aggctcctga gagtttacaa gctcagtgtc 
cccattccaa gttgcagggt cccattcttt 

45 

cacctggtga gtctatgcat gcaccttttg 
ttgtgtctga gtgtccacaa tttcagctct 
50 ggcaatctta gaagatttga ggctcagtat 

gagttcataa ttttctttca tcactttgtc 
aactccatcc ttaatattct gtttttctag 

55 

tcagggttct ctagaggaat agaactaata 
ctaataggat agatgtatac ataaagggca 
60 tagtctttcc atgtttttct gcctcctttt 
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cttttctgtt 


ttccctaagc 


gttggctggc 


30360 


gagagaaatt 


ttaaagctgg 


gcatccaggg 


30420 


accccacaag 


ctgcaaaaac 


cagtaagttt 


30480 


atgcgaatag 


gtgtgggtca 


cagacatcaa 


30540 


gcaagtggag 


gcagggcgag 


atcacaggac 


30600 


atgaagtttg 


aggcaccatg 


gtcattgata 


30660 


caactggtct 


gaccaaaaat 


ttattaggcg 


30720 


cgctatggga 


gactggggtc 


tatttcaccc 


30780 


tctggggggc 


catttatagg 


cctatacccc 


30840 


ccttgctgag 


aaaaagaatt 


cagcaatctt 


30900 


atggctctgt 


tctgcccggc 


tcaccggtgg 


30960 


tgaataattg 


ctgttatcct 


gttctttttt 


31020 


acacatgctg 


tacaatttgt 


gcagttaatg 


31080 


atcttcctca 


gctgacagga 


ttaagagatg 


31140 


ggtattgatt 


gaggaagtga 


taagtgtcca 


31200 


gcagtaaagg 


caggcataag 


aaattataaa 


31260 


atgaaatctt 


cacaatccac 


attcttctgc 


31320 


tccctgactt 


cccacaacac 


atcactgttg 


31380 


acaaaggtga 


aaaggctgat 


ggcagtgtgg 


31440 


atggggaagc 


tgttttttct 


ggcaaaaaaa 


31500 


cccagcttca 


tcaaggttct 


accgcacatc 


31560 


ttcaatcaat 


gctctcgctt 


taacagtaga 


31620 


ttgcaggtca 


gcaaatcgca 


tgataaaagc 


31680 


ttctctacag 


aagataagac 


tctcactcag 


31740 


ctgcttctga 


agcttagagt 


tagaatccct 


31800 


cagtgaactt 


aggagcaaaa 


ccaaaaaaag 


31860 


atcatctcct 


ggtaccaaaa 


tctgtattag 


31920 


ggatagtcag 


ggttctctag 


agggatagaa 


31980 


gtttattaag 


accagaagac 


taaatgagtc 


32040 


attctggcta 


tgctggcagc 


tgattgactg 


32100 
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aggatgggtc tccctttcct agtccaccaa 
tcacagacac acccaggaac aatactttgc 
5 tattaaccat aacagacagc aaacactgtt 

tacatatgat atataaccta tatatataat 
ttattaaact cttactatga tctggacatt 

10 

tttcacatca tcaccacaaa cttttgaggg 
gaagagaggt taagtaactt gctcaagatc 
15 atagggtcac tctattatca gaatcacata 

tccattttct tttcaagatc acattgaaaa 
tgttcagtag attaccaaga actgcaggag 

20 

ctcttgtagg cattgaacta gattggtaaa 
gctaccttcc attattcttt caaccttttt 
25 tttctccctt ttacagtata agtaccagta 

ttggcctctt cagttctatt cctatttatg 
agataccata tacatatttt ttcctgtggt 

30 

caaatacaca aacatacata cacacttcta 
ttttccaagg tgctttactg aattttttaa 
35 ttggatgttt gcaatatatg ttattcatac 
caactatttt tcttatataa ttattttggt 
tctacatgtt tatcttgatt ttgactttaa 

40 

aaacatgata aagacttgta gactgagaca 
tatttatatt ttaccaataa tgtttaagtc 



50 



60 



tatcctatca 


acactgtaaa 


catgagtagc 


tttattatat 


atattagaat 


atttcaatat 


ccaaatgttc 


atgatgtatt 


ttatgttaca 


tttccaatgg 


tgatttagat 


aacatgaatc 


gaagtttatt 


gagctatttt 


attagtccag 


catctgagat 


caaatttgta 


aaggaaataa 


gagagtccaa 


ccaacttgat 


tagctatttt 


ggattcagat 


gaagtagtat 


attctagtta 


tactatttta 


ttttataact 


ttttaaacat 
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ctcaaatgtt 


aatctccttt 


gtcaacaccc 


32160 


atccttcaat 


tcaatcaagt 


tgacactcaa 


32220 


ccatctctgg 


aaagttgcaa 


tatatatata 


32280 


atatatatta 


ccaatgtcaa 


taatagcttt 


32340 


ctgctaagtg 


ctttttatgt 


gtacttttta 


32400 


aggatttctc 


attatcttca 


ttttatggat 


32460 


acaactgttt 


ggtacaggat 


cagggttcaa 


32520 


catgtaacta 


cctaggtttc 


cgctttcatg 


32580 


ccagcttaca 


tggttttagc 


cttccacacg 


32640 


ctgtgtttgt 


ccactgagtc 


atcactcaga 


32700 


tgcagtaaat 


tggtaaaacc 


cttttttgct 


32760 


ttccgaacat 


cacctgcaaa 


gtgcctctct 


32820 


ccaagagcca 


gctgtaagtt 


tgtattatcc 


32880 


tttcacattc 


tttgttaagt 


ttttcctctg 


32940 


aaatgaaatc 


ccttttttca 


atcctatctt 


33000 


atatgtttat 


ctaatttctg 


tatataacat 


33060 


agagtttatg 


gtacttttta 


gtttatattc 


33120 


ttaaataaca 


gtagttttag 


ccatttgttt 


33180 


taatataccc 


caaatagtaa 


tacataatag 


33240 


taagatggtc 


ctactttttt 


tttaatcatt 


33300 


gatgtcctat 


cttgttacag 


aaagtataat 


33360 


actaataatg 


ttggattgta 


ttaaatgtcc 


33420 


aattaacagg 


taacgttttt 


cttaaattta 


33480 


taaaaatctt 


tgtagagttg 


aaggaacatc 


33540 


tgatggatga 


tttgctctac 


tgttattttt 


33600 


atctctatgt 


ctgggtattt 


ggaaactaaa 


33660 


tttctttata 


ttaggatata 


atttttgaat 


33720 


aaaactctaa 


taatggaata 


ttataaggta 


33780 


ggctaaatga 


aaaataattg 


ccccattgat 


33840 


tcaaataact 


gggaactaaa 


ttttatgtgc 


33900 


attttcaatg 


agaattattt 


tagtacattt 


33960 
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tttgtttatt 


gtattcaata 


ttaataccat 


5 


agctttatct 


ttcagtgtcc 


tcaaacagtt 


aaggcttgat 


aaattatccg 


gtggactcat 




aagagaaaat 


tatgtgcaat 


ctcttgcttt 


10 


atctaatt tg 


gaataaagtt 


tactaagttg 




tctccatttt 


tttctctcat 


tcttatttta 


15 


gaattagaac 


aacaaaaggt 


ctggtgagta 




attaatcaat 


aatgtattct 


gccagggtat 




aagtcaggag 


caactatgca 


ttttacttag 


20 


taatgataga 


attactgaca 


taataatgga 




tttatgaaga 


tatggaaaat 


tatttagaac 


25 


ctgcagcttg 


ggattcagcc 


catgacagtc 




ttgaccaatt 


gattaaaagg 


tagattctgg 




aagactttgg 


gaggcctagg 


tgagtgggtc 


30 


gccaacatgg 


tgaaacccct 


tctctactaa 




gggcgcctgt 


aatcccagct 


actcaggagg 


35 


ggcggaggtt 


gcagtgagct 


gagattgtgc 




actccatctc 


caaaagggaa 


aaaaaaaagt 




aattttcaat 


ataagaaatt 


cacaataaaa 


40 


gtaactc t ca 


tgtcaagatg 


cctttcataa 




acaagggttt 


ttctggctaa 


acactatttg 


45 


actctggagc 


atgatcaata 


tcggttttcc 




tgccgggacc 


tgcagccaca 


ctacactctt 




aaaaagcctc 


cttaccttac 


ccatgagaag 


50 


aaaaafcaa tt 


taaaatgtag 


aaaaacatcia 




agatcgggca 


ccagaagcaa 


ggaatggcag 


55 


tgtgtaaaaa 


atcattgaaa 


accctgataa 




ggaattatct 


gaagaattta 


agctggagaa 




gatcagagtt 


gaaaccacta 


taaggagaca 


60 


tagacagcag 


tgtaaagtaa 


ttaacagaga 



ataagaatta 


cttcttatca 


ggaatttgga 


34020 


ttaatatcat 


tccagttagc 


tattatttga 


34080 


ctggcagaat 


ttttctgttc 


tttctgggaa 


34140 


ctaagataat 


tgacctgttt 


atacatatct 


34200 


tgatttccaa 


tacttgttta 


aaacttttta 


34260 


ttatttcagt 


atccttatca 


gttgcagcag 


34320 


atgttttgtg 


gggggtatca 


aaaggaaagg 


34380 


aaaacaggtt 


ttgtttgttt 


gtttttaaaa 


34440 


caggatctgt 


ctcaaaaatt 


acgataaggc 


34500 


tattttgtca 


aggtagttga 


tgagggaatc 


34560 


aaatatgaag 


tgggatcact 


caaaaaagct 


34620 


acaccaagta 


ttggtgtggc 


ataatgttta 


34680 


ccgggcgcgg 


tggctcatgc 


ctgtaatcct 


34740 


atttgaagtc 


aggagtttgc 


aaccagcctg 


34800 


aaatacaaaa 


aaattaccag 


gcgtggtgac 


34860 


ctgaggcagg 


agaattgctt 


gaaccctgga 


34920 


cactgcactt 


cagcctggtg 


acagagcaag 


34980 


agattccaac 


tgtatccaaa 


aaggattttg 


35040 


ttattacata 


agaagcagca 


aatttaaaat 


35100 


agccaataaa 


atgcaaaagc 


caagatctca 


35160 


agttttcttt 


gttcaaaaga 


taaatatcaa 


35220 


aaatctggcc 


taacccaact 


tcctagtgtc 


35280 


cacactttcc 


tgtgtaattt 


taatacttat 


35340 


agaaagtaca 


aaaaggaata 


cacaaccgag 


35400 


gaggcaagca 


gtgtttagaa 


caatgataag 


35460 


ggaaggacac 


agacatgact 


gtccagctaa 


35520 


atgatttgga 


gattatctca 


tcttctcaca 


35580 


taatgggatc 


atatttgtgt 


tttggcgaga 


35640 


agttagaaga 


ctattgaaat 


aagataaagg 


35700 


tatatctgag 


gatgatgttg 


gattgcttaa 


35760 
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ttaattttca tttctttcta ggttggaggt 
ttggctatat cagggatctt aatcaatgca 
5 cattactgta accacgatca gttgtcaagt 

gtgagtgttt tctcttttca tgagatatta 
ctggcaacaa caataattat catttcctaa 

10 

ttgagaaata gaataatctt tcatttagga 
ttttcctcta tatgtgatga ggcaaaatgg 
15 tccttaatag ttactgggct gggcattttg 

tgatgcagtg tctactgtta tctcctccag 
gtcccagccg gacacggtgg ctcacgcctg 

20 

gcggatcatg aggtcaggag atcgagacca 
actaaaaata caaaaaatta gccaggcatg 
25 ggaggctgag gcaggagaat ggcgtgaacc 
agtgccactg cagtctggcc tgggcgaaag 
aaagatgtca gtgcctattc tcaaataata 

30 

agtggtaact cagtatcctt tcccacatct 
atcctcaaca tgtggcacct gaaaaattct 
35 ttgtaccagt tgtccttatt tcaatgctat 
tacacaacat tttgtaagta ataaaagttc 
gggaagagtt cttgcagcac atcacagcag 

40 

attattatta ttattgttgg agatggagtt 
gcagttcaca atgcacgtgg tggcacaatc 
45 tcaagtgatt ctcctgcctc agcatcccaa 

gcctggctaa tttttgtatt tttagtggag 
cttgaactcc tgacctcagg tgatccaccc 

50 

ggtgtgagcc actgtgccca gccaaatctt 
acaccaccaa actttcagtt acttttcctc 
55 cttagttctt ttcctcactg tgcccttccc 

ctaacttact agacagaatc aactgcattc 
agtcttgagg gctacgtagt cacaccaaaa 

60 

tttacttccc ttgagatatc taatgctttt 
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agtctaggaa 


agaatatcac 


cagttcagtc 


35820 


ataagcttga 


cgttttattc 


attccgttac 


35880 


aattgttaca 


tgactatgtc 


cattttaatg 


35940 


tcatagaagc 


agtttagggg 


agtgctggct 


36000 


ttgctgaatg 


actttggaag 


ttgtaagtga 


36060 


gtcaggtaca 


catctcatct 


ctgtctttat 


36120 


gaagaaagct 


tatagttata 


gaacacctac 


36180 


cataaatttt 


tctgtttaat 


tttaaaacat 


36240 


acccctctaa 


aaaatggcgt 


caaaaagtca 


36300 


taatcccagc 


actttgggag 


gccgaggcag 


36360 


tcctggctaa 


catggtgaaa 


ccccatctct 


36420 


gtggtgggca 


cctgtagttc 


cagctactcg 


36480 


caggaggtgg 


agcttgcagt 


gagccgagat 


36540 


agtgagactc 


tgtctcaaaa 


aaaaaaaaaa 


36600 


aagtaaaatt 


gtcttgaaca 


gtattttata 


36660 


ttggatgttc 


ttgcaacgtt 


aaccactttt 


36720 


caatctccat 


tctgaatttt 


actacttttt 


36780 


ctcaattgct 


ttattctgaa 


aattaccact 


36840 


ttgatgtttt 


gggccattgg 


gtaagtgtgg 


36900 


actaaaatgc 


acaatttaat 


ctttataatt 


36960 


tcactcttgt 


tgcccaggct 


ggagtgcagt 


37020 


tctgctcatt 


gcaaactcca 


cctcctgggt 


37080 


gtagctggaa 


ttacaggcat 


gtgccaccac 


37140 


acggggtttc 


tccatgttgg 


ccagcctggt 


37200 


acctcggcct 


cctaaactgc 


tgggattaca 


37260 


tattattaaa 


aagcctactg 


gtcattttca 


37320 


tctatgaact 


ttcagccttt 


cgccttgctg 


37380 


tcatgtgtat 


gtggaaactc 


aacactcttc 


37440 


tgtttacttc 


atttctagct 


aatcccacat 


37500 


taaccgcttg 


aggtgtcttt 


gcgaaactaa 


37560 


tatagtaagt 


aattttattt 


cttgccactc 


37620 
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actcaagata agaaaagaaa gtttcttttg 
agttagaaat tggcaaaagc aagattcaat 

5 

tatttgctat tgctatcccg aagaagaacc 
catagacttt aatggcacaa tgatagagac 
10 tgatttcgtg ggcatgggac ctgtgcagtt 

tgcttggtct aatgctctgc tgttgctatc 
acactctgta aattacgtag ctggttctgc 

15 

tgatgaattc ctattaactg tgtgaacatt 
cataagagct ttctggaatt tgaagcaatg 
20 cagtccacag gtctatgagc ccatagtgta 

acataagaaa gagaacagtt ccctaccaca 
atagaaatgc tggaaagcat ataattatta 

25 

aaagtagtta gagtctacca cccggaatca 
tcaactatac ttaacctaga aagacagcag 
30 taattttaat aagagagtct ataaattaca 
ttcccagagt ccatgatgaa ttcctgttaa 
agcaggggac ctccataagg gcttcctggg 

35 

gcacctgcac aatcagtcca caggtctata 
gtatttgttt ttgtaaataa agttttattg 
40 gtgtattgct gcttttgtgt tataataaca 

gtctgcaaaa cctaaaatgt acattatctg 
ctacacttct agatggtaaa atccttgaag 

45 

ctatattctg tgcctacaac aatgtttgaa 
caagaaaata aaaaagcaat gaatgcaaat 
50 tcatatcctg agctcttggt ctattggctc 

ggagagaaaa agagagaata tttaaaccag 
aatagaaata agtgtactgc ccaattgttc 

55 

gactagccct aaagccagtg ctctggggag 
attgttgtat tagcttctta ttgttactgt 
60 catataaaat catttagtgt ctcaagacaa 
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agaatttgac 


aaattttccc 


aatgtcactc 


37680 


tgtagaactt 


cttgtctcta 


atttccctct 


3774 0 


ttacttttag 


tttatgaggc 


tctcttcatt 


37800 


aaaatctgga 


attggccagg 


ttcatagggc 


37860 


actcaggacc 


tcagagttag 


aagagccctg 


37920 


ttgaaattat 


ttttaaaaat 


tttaaacaag 


37980 


attcatctca 


atcacctttc 


ccagaggtca 


3 804 0 


tttatgcctg 


ttctaatagc 


agaggacttc 


38100 


agtctcttct 


tgcctgggca 


cctttacatt 


38160 


tgccccatca 


cagggttggc 


accccggtag 


3822 0 


aaggaacttt 


ggttctcctg 


gaagaaaaag 


38280 


aagaggccag 


aaaaagttga 


gctctagaaa 


38340 


ttaggcagaa 


ccatatgaaa 


ttgccaatga 


38400 


tttcgtatag 


ctcaagctaa 


gatatatatg 


38460 


tagctggtcc 


tgcattcatt 


tcagtcacct 


38520 


ttgtgtaaac 


atttttatgc 


ctgtttttat 


38580 


atttgaagca 


acgggtctat 


tcttgcctgg 


38640 


tacatgctat 


gtttttggaa 


gtccagccct 


3 8700 


aaatgtaacg 


aacccattta 


tttacatatt 


38760 


ggtttgaatg 


tgtgtgccag 


agaccttacg 


38820 


gcctttacgg 


aaagcttgcc 


atcttctagt 


38880 


acaggggaat 


tgtctgtttt 


gtttgctgtt 


38940 


acacagaggc 


cgctatagct 


gttgaaggca 


39000 


attctcaagg 


tctccttaag 


tctgcctcac 


39060 


ttccgcaaca 


gagggactat 


ccctaaccat 


39120 


ataatcgctg 


catcagtccg 


gggtaaaatt 


39180 


tccaaatttt 


gactatgttt 


cttggctgaa 


39240 


atggattaaa 


gactgaccta 


cttgatgttc 


39300 


aacaaattac 


tacaaatcat 


taggcagaac 


39360 


tacagtttgg 


ttatcttctg 


tttctggagg 


39420 
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tcagaagttc aaagtgggtc tcactgggct 
tctgaaggct taggcaacaa taagttttac 
5 ctttcttggt ttgtcatccc atcctctacc 

ctctcttgaa ctctattctg cctcttgctt 
acccacctga ataatccagg aaaattctct 

10 

gctgcaaatt taatcctttt tgttgtgcaa 
cattcaaatc cccagaagct gaggattagg 
15 attactctgc ttacaatcac gaagatatgg 
tctagtactc aataggatat gatgacaatt 
gcactttcat cttccatgcc cacaatagca 

20 

ctgtgattat tttattagtg aaaaaagtga 
aagggtatgg ctgccattgg gaggcaggtt 
25 atgacattgt ggttgctggg aaatgttttc 
gttatgagtt ttctcctagg gtacggatgg 
ctgcattgct gtgtccctct ctgcctttgg 

30 

ggtcagtatt ggccttcatt tgaaggtatc 
ttaaacaaac ttatagtctt aacacagtac 
35 aagagtctgg gcccagtgga tgtgggtcct 

ggagttggcc ggggctgtgg ccttgtctaa 
cggaattaat ttcctttaag ttgaggactc 

40 

aggagagcca gagtctctag tgctttaatc 
ggtccatcca gaataatgag tttttttata 
45 cgggacttgt atcccatctt atacacaggt 

aggggtcagt gagtgtcacc ttagaactct 
tttccccttc ctttctcaaa gggcataacc 

50 

tgaagaggtc cccaaaagta cttggtttta 
ggaatacgga aggaatgtca tgatctcaga 
55 ctctcttacc ctgagacctg cagcagatgc 

cagagagagg aaagtgagga atggaaggag 
aatggggtgg gatatcctcc tagaattgag 

60 ~ 

ccccaacagc tactcattag cttcattacc 
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aaaatcagtg 


tgggcaaggc 


cgtactcctt 


39480 


tgccttttct 


ggcttctgta 


ggctgcctac 


39540 


ttcaaatcaa 


gcaatgcagc 


atcttcaaaa 


39600 


ccacttataa 


ggattctgtc 


attactttga 


39660 


cattccagct 


aataagcaac 


cttgatttca 


39720 


aaagaacaat 


ggtcccccca 


gggaggtctg 


39780 


ggactaggat 


gcagacctcc 


ctgggggacc 


39840 


aatcatagta 


ttttctcaat 


gatatgagtt 


39900 


atctttatac 


taccttccta 


acattttaaa 


39960 


ctgggaggtg 


ctgaactagg 


gaataaatgt 


40020 


gaagctactt 


cagcagatca 


agaattggtc 


40080 


tcctgactcc 


cagccagtgc 


tttttctctt 


40140 


acttgtcatc 


tccatttctc 


acattccctt 


40200 


catggtgctc 


ctcttaagtg 


tgctggaatt 


40260 


atgtaaagtg 


ctttgttgta 


gccccagtga 


40320 


tgtattactt 


ttgtattgct 


gtataatcag 


40380 


acattcatta 


tatttcagtt 


tccatggatc 


40440 


tggttcagga 


tcttatgact 


ctgtcatcag 


40500 


agcttctgat 


tctctttcat 


gtgaatgtgg 


40560 


atgacaatgt 


actttttctt 


tgagaccagc 


40620 


ctctcataaa 


gcattcacat 


gaagagatga 


40680 


aactgaaagt 


caacaaatca 


gtcctacata 


40740 


ttcatccaca 


cttgagagaa 


gggaatcaca 


40800 


gcctagcaca 


agagctaaat 


aaaagctctt 


40860 


tctgaatcca 


gataagaaaa 


tatttcacaa 


40920 


ttctattatc 


ctattacaac 


aacttttcct 


40980 


cttctctatc 


agtctgtgcc 


cacctgccca 


41040 


tcctggcagt 


taagccaaaa 


gaaagttaca 


41100 


agaagaaatc 


aagaaagagg 


ctctaagaat 


41160 


ggttccagaa 


tgaggaaatg 


gatagtaaaa 


41220 


caaagtctga 


aaggcagaca 


aaaccgtcag 


41280 
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ccaaagaata aagtaattgt agtttgagga 

tttgtgcctc tctgtcttct ttagggattc 

5 

accctccctc atagaagaga gattgactat 

attgtaaaac cctagaggtg agagattctc 

10 aagtgaagta actttcatgt catgatattt 

gaagctgggg aatgttgttt ccctaatccc 

ggggcatatc ttggctaaat ctctatctct 

15 

aatgttgcac tgagtttgtg gaagaataac 

tgctattcca ggccagtgtg gtttttgcaa 

20 tgggtatgtt gatgtcatca gcatgaacaa 

catgatacct tggtttaccc atcatattgt 

cttgcgagac cagggcagtg gtcatcagca 

25 

tgtcctaaat ctagcctgat ctaaagaatg 

aatgaatcag agaagaaaag agtaaactct 

30 gggcactctg aggtgatttg ggtgactcac 

caccatcaaa ttctcacatg gcagaaatag 

gccaccaaaa gattaacaga agaatgctcc 

35 

ccacatgaga aagtaccaga atccaactcc 

atatggaatc caattatgac ctctgtgtgt 

40 atatatatat acatatatat atatgtgtgt 



45 



tttttcccct 


ggaactcaac 


aactaatttc 


aaattaataa 


ataatgcatt 


taatgaggca 


aggactttat 


ataaaggcat 


taaactggca 


tattgctaaa 


acgaggtctc 


catgcaaaca 


ctcacttact 


tgactttttt 


tctgccatat 


agtggagatt 


catgcccttc 


tcccccctcc 


agtaccttgg 


aatagtatag 


tcagtccttg 


tatacatgga 


gtaaaaatca 


tatgaaggat 


tcttcctctc 


ccttccccca 


ccttctgctg 


cctgtgtcac 


atatggaaat 


gtgcaacata 
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gattgtttct 


ctggtgccct 


tctccatcat 


41340 


ccaggcaata 


ccatatccca 


ctccccgaca 


41400 


agcgcattaa 


ctctagtggt 


ggggttttgc 


41460 


agcaattacc 


aatgattttg 


taatttcatc 


41520 


aagtctgtgt 


tccctttggc 


tctttgcata 


41580 


ctggtttgag 


tccagaaaag 


cagtggcttt 


41640 


tcctggtccc 


ccaccgtgcc 


ctaccaatcg 


41700 


cagaactttt 


cttagttttg 


tgatgatacc 


41760 


tgctttattt 


gtaaccagca 


aatcatgatt 


41820 


cctggtttgg 


aagaagagcc 


ccatattttt 


41880 


gcactgtcat 


ccacatgtaa 


gattgatcct 


41940 


gctcaggtca 


agagctatca 


ctgaatgttg 


42000 


actgagaatc 


cccacatgac 


tgatattgag 


42060 


gataatgatg 


actttatgtg 


ggcaggaaag 


42120 


ctatggcatt 


ctgattgttt 


gtgctaatta 


42180 


catctcccac 


accacttaag 


acagtttgat 


42240 


agaaatctat 


gctgactgta 


acacaagaac 


42300 


aatactgata 


gacatattga 


tatcattatt 


42360 


gtgtgtgtgt 


gtatatatat 


atatacatat 


42420 


gtgtgtatat 


attcaaaatt 


ttgttctcat 


42480 


attggccctt 


tatcgagagt 


actagaagtt 


42540 


gcagcacttg 


aaaggttttc 


attcatcatt 


42600 


aataagattt 


ggaagcagaa 


gggcaaaagg 


42660 


catacttctg 


ctcccctgta 


taacattcct 


42720 


ttggggacca 


aagtgctttt 


tccttcatga 


42780 


ttttccttct 


gctttccttc 


acccatagaa 


42840 


catgtgcaca 


agctatcatt 


tcagtaaagg 


42900 


cagattcaac 


ttatattttc 


tattttttct 


42960 


ggcagaatta 


tatcttaatc 


aaatgtgtat 


43020 


tggtatttgt 


taatgtttgt 


taattacatt 


43080 
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tgctttttta ttgcagagca aaaataaaat 
ttctattcag atgcaatatg atggaaagag 
5 gtggtaaata atgtggactt tctaatcaga 

ctttgttgtt tactacctat gtagctatga 
tttctcttca gcaaaatagt aataatattg 

10 

gtataataaa gatagagcaa ttagtactct 
aatacatata catatgtaca tatatacata 
15 tcagcatata tttgtaaagt atctacaatg 

atagcggtga ataagagaga taatatcctt 
gacacagatt aacataaaac aatcaaaaca 

20 

gaattaatgt gctggagagt ggttgttggt 
taaccaggaa agttcttttt gaggaagcga 
25 gaatttgcat ggacattcat attttccaac 
cactcacgta tctattgtat ccacctttct 
cctccttctt tcaatgctaa cctgtctact 

30 

gaaaaagtct attatgacat tctatcttct 
aaccaccaaa tagaacctta atactgttgc 
35 actattgcat agatgttcta tgtctcttat 
tttattactg gcatacgtgt gtgtctgtaa 
gcagaaatag caataaatgg aaatgaaatg 

40 

tgtcacaggg ttcatatagt agcatgatgg 
agcttcaagc ccaacaggaa agaggaaaga 
45 aaaatgatga gagtaatttg attggaaatg 

aggtcacaaa gctaccacag aacttcactc 
gagcagaact gagaagcgcc aagtgaagga 

50 

aagtcaaagt ttacctcaag ttgggccaat 
gaagatgatt aactgctata actgggcaaa 
55 ataaaagcca gaactcatac taaatcctag 

gtcttggatt ttgtaaaata tcctaactta 
aagtggcctg agaatgcctt ctgttggttt 

60 

taacacttac ataagcaacc taagattagt 
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tagaagcaat actttcatgt gctacctcct 43140 

actaaatgtt gatgtcgaca gtactgtatg 43200 

caaccctggg ttttcatctt gctatcccat 43260 

acatgccaca aaatttatct gtgcttcata 43320 

tctggcccat agacctatga taatttcttg 43380 

ggtacatatg gagcctcaaa agctttaatt 43440 

ttatttatat ttgtcagttc atcattttat 43500 

gggcagacat tgttttaagc atataagaaa 4 3 560 

cctcttgtag gttaaacttt tatgggacag 43620 

tttttatatg ctgttagaag ctaactgtaa 43680 

agtggttgtg aaatatgagt cattagagag 43740 

tattttcatt atgataagat aaaaagtatt 43 80 0 

tcagtatata ggatttctca ttctgcccta 43860 

ccatgcacgc ttttcatggg aatgtgaggc 43920 

ttctccagct ccggagatgg acagtatctg 43980 

tggctgagga ttgaaaggta atctagcaag 44 04 0 

ttactattat gtagtatata gaaccttaat 44100 

gggacatcag tgagaataca tgtagcctac 44160 

aggaagtatg tttaaaagaa agtcaaataa 44220 

aatgcttgat ttcatggatg agcatctgtc 44280 

ctgctgattg cagcaaactt acctcttcac 44340 

gagggagatc atttcttaga cagtgaagtg 44400 

tgtcagtcat tcacacaatt aatttttcct 444 6 0 

tagttgagtt gatgtaattt gctcacagac 44520 

tcagacccta gcaaaatcct ttgcatctcc 44580 

ccctgacgtt ttaaatatat aaaggcaagg 44 64 0 

atccttcaat ggcttcctct ctaactcata 4470 0 

aagtccttac atctcctgat tctttgttct 44 760 

tatcctgata tattgccttg agattatttt 44820 

aaattagcgc atacataaac tttttaaggt 44880 

acatttttag ttgtaaaaag ctaatgcaaa 44940 



-171- 



WO 02/062946 



PCT/US01/48437 



tattttaggc aattcatgca aaatattatg atttcaactt gtagagaagg ctgcctcatt 45000 

tatttcatat catgcaattt taaatttatt ctttatttgg ttattctagg ttctatctat 45060 

5 

aagtgattag gatgtaaagc agtaggaaaa cgtccatgaa aactttgcct tcttttgcta 4512 0 

catttacaag aaaaatagtt attcttttta tttatttatc taatttttga gacagggtct 45180 

10 ggctctgtca cccaggctgg attgcagggg tatgatcttg gctcaatgca acctccacct 45240 

ccttggctta agccatcctc gcacctcagc ctcccaagta gctgggctta caggcataca 45300 

ccaccacacc tggcttattt ttgtattttt tgtagagaca aggtttcgcc atgttgctca 45360 

15 

ggatggtgtt gaactcatga gctcaggcga tccacccact ttggcctccc aaagtgctgg 4542 0 

gagtacaggt ttgagccact gcacctggcc aagataatta ttctttacat acatgattaa 45480 

20 attccccaca acttactagc tgtgggtagc ttatcattgt gtttcttcag tttcctcatc 45540 

tataaaaatt acaatagtac ctaccttata ggctcttaca aagactacag gtattattaa 45600 

aacttgtaat cacagaatga agattgacac tggtaggcga tatgaaaaga atggtgagtg 45660 

25 

atactgatag gtgacatata aagaattatt aaacaatata aaagcaaata acaaagtaaa 45720 

aataaaataa taaaataagc taaagattaa cataagtctt aatttgagca cataagcagt 45780 

30 tattatactt ttaaatattc aggaaaagaa acagataagt gtgtccacag aattctgatt 45840 

tttaaataac tgtactaagt ataagagtaa tgtccatatg ggacacaaag aagctgcact 45900 

accctgaaaa tcggtaattt tcattaactt cagtgaagtt ttagaagtgg gaatttataa 45960 

35 

tattactgtt tcaaatttaa taccagaata atctaagatt ctctaaaaat ggagtcatct 46020 

tccctcaaac ctattttcca gtcattgtga gttaggtaaa atgagattcc aactgtagta 46080 

40 cagttttcca ctgtcaatag taatacgcaa ttgggataaa gtagtggatc tttttctcta 4614 0 

agagaggagt atttaacttc tacagatgtc atattgagaa tgacatggca gaacaggcac 46200 

ctggagtcag gtggaaagga tgatcctcct acaagtaatc agctgggcat gcatatgata 46260 

45 

taaatcaatt ttgtgtttag gaaaaactca gtgtgggtca cttagtggct ttgatgtata 46320 

tttggaaact aggactgttt tcggtttggg taatagctct cagttggcta tctgagagga 46380 

50 tttttaggac agaagaaaca tagtagggaa caagaaaaag actataatct aagccaatgt 46440 

agtgtaacac aggtgcatgc attcagataa ttttagaatg ctactgacca gattgaatgg 46500 

caagctccca tacattctat atacaaaaac atcagaaagt tatttaaaaa tgtgtactct 46560 

55 

gtcattaatc ttcattgtaa acacattcat agcaccaaag gataggtttg ttgtggaaat 46620 

atagcaacaa tttgccaaca gcacacacca taaatcttct tcaaatcttc aagagatagg 46680 

60 tcctcaccta caccaattcc tttcctttag atcatctcaa cttgtattaa taagacgtag 46740 
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aggcaatttg attctctgtc tgcttagtga 
taggccaaga attgcctttg tattggctgg 
5 caattagtga cagtggttaa ccatcaattt 

cataaggtct tgagtatata aggtgatatt 
cctattcagg tagagtcagt atacttggga 

10 

ttatcgagtg ctcaatggtg cccaggctgg 
acctccacct cccagccgcc tgccttggcc 
15 cggccgccac cccgtctggg aagtgaggag 

cgtgaggagc ccctctgcct ggctgcccag 
tgccatccca tctaggaagt ggggagcgcc 

20 

gaggagtgtc tccgcccggc cgcccatctt 
gccctgtctg ggatgtgagg agcacctctg 
25 agcgtctctg cccagccgcc ccatctgaga 
ccgtctgaga agtgaggagc ccctccgccc 
gtctccgcca ggcagccacc ccgtccggga 

30 

gctgccccat ccgggaggga ggtggggggg 
ccgggaggtg aggggcgcct ctgcccagcc 
35 cccggccagc cgctccgtcc gggagggagg 

ccccgtccgg gagggaggtg gggggttcag 
a 999 a 99t99 gggggtcagc cctctgcccg 

40 

gggttcagcc ccccacccgg ccagccgccc 
cccgcccggc cagccgcccc gtccgggagg 
45 tgggaagtga ggagcccctc tgcccggcca 

tcattgagaa cgggccatga ggacaatggc 
ggggaaaaga ttgagaaatc ggatggttgc 

50 

agacttttca ttttgttctg tactaagaaa 
gtgaccttac ccccaaccct gtgctctctg 
55 tggattaagg gcggtgcaag gtgtgctttg 

ttaagagtca tcaccactcc ctaatctcaa 
cgcagggtcc tctgcctagg aaaaccagag 

60 

tccctccact attgtcctat gaccctgcca 
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ttgcaagtgg 


ctgctctgct 


gggttgaagt 


46800 


tatgtacatg 


tctgagagca 


agggctctgg 


46860 


gctctgtgaa 


taataaggat 


aaaaacaagt 


46920 


actgataact 


tctaactata 


agtagtccaa 


46980 


ctaaaaatga 


tttatttttt 


aaaaaagtat 


47040 


agtgcagtgg 


cgtgatctcg 


gctccctaca 


47100 


tcccaaagtg 


ccgagagtgc 


agcctctgcc 


47160 


cgtctctgcc 


tggccgccta 


tcgtctggga 


47220 


tctggaaagt 


gaggagcgtc 


tctgcccggc 


47280 


tcttcccggc 


caccatccca 


tctgggaggt 


47340 


ctgagatgtg 


99g a gcgcct 


ttgccccacc 


47400 


cccggccgcg 


accccgtctg 


ggaggtgagg 


47460 


agggaggaga 


ccctccgcct 


ggcaaccgcc 


47520 


ggcagccacc 


ccgtctggga 


agtgaggagc 


47580 


gggaggtggg 


ggtcagcccc 


cgccaggcca 


47640 


gtcagccccc 


cgcccggcca 


gccgccccgt 


47700 


gcccctactg 


ggaagtgagg 


agcccctcta 


47760 


tgggggggtc 


agaccccgcc 


cggccagccg 


47820 


ccccccaccc 


ggccagccac 


cacgtctggg 


47880 


gccagccgcc 


ccggccggga 


gggaggtggg 


47940 


cgtccgggag 


ggaggtggag 


gggtcagccc 


48000 


tgaggggtgc 


ctctgcccag 


ccgcccctac 


48060 


ccaccccatc 


tgggaagtgt 


acccaatagc 


48120 


ggttttgtgg 


aatagaaagg 


ggggaaaggt 


48180 


cttgtctgtg 


tggaaagaag 


tagacatggg 


48240 


aattcttctg 


ccttgggatc 


ctgttgatct 


48300 


aaacatgtgc 


tgtgtccact 


cagggttaaa 


48360 


ttaaacagat 


gcttgaaggc 


agcatgctcg 


48420 


gtacccaggg 


acacaaacac 


ctcggaaggc 


48480 


acctttgttc 


acttgtttat 


ctgctgacct 


48540 


aatccccctc 


tgtgagaaac 


acccaagaat 


48600 
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ggtcaataaa 


aaattaaata 


aataaataaa 


5 


ttttttcctt 


agacttactg 


aagcaaaatg 


catatatttc 


tgtccccttt 


atcctctaaa 




caaatagtat 


aagaaacgta 


atgatactaa 


10 

1 \J 


LLay o octet uy 


afaaarraaah 
ct Uctctcty dcici i_ 


y y y l. a. a. L.v^dd 




gaattgctgc 


aaaaactgct 


gtctattgag 


15 


tatatttata 


aaatcaaatt 


aaatccactt 




aatcattgtt 


acaacagtta 


taagaggtat 




ctgaatacta 


tatgcaaagt 


caggagcaaa 


20 


y LLaLL^L L.ct 




faa ai-t* ClCiZK t" 

Od.ci.ci L- Lyyou 




catctcgggt 


gttcacaatg 


accattaaaa 


25 


gttttcaaaa 


tttgccattt 


tccacaactc 




gtaaacttgc 


aatactctgt 


ggaaaggatt 




gaacatgaat 


taaaccagaa 


aaaaaggaaa 


30 

\J\J 


ClCt L. ClCt L. Ly d L 


yL/dLLL,LCtLL. 






tcttgttatt 


aatttttatc 


attaaaatgt 


35 


ttatttttct 


tcattattaa 


aattgatttg 




ttagaattag 


catgtcaaca 


tatccaatcc 




ggtattgaat 


ttataggtct 


ataagggaag 


40 


Cl Cl L. O V_ Ct C U Cl Cl 


O- Ciy Cl d ^ Cl LQO 


t-tj-ttaatta 

i— l u L l aa l uy 




ggttttttca 


ttttctgtta 


acatgaggaa 


45 


ctggtactaa 


attttctcag 


atctttttgt 




taagttttga 


ttaatatttt 


caaaggatgt 




gtactctaat 


aatgtcttat 


gactgtcttc 


50 




1-ahrl-htata 
l- cl l- o i— i_ y 


ci l-cici '-■y ciy <— , — ■ 




tttcttactc 


atttttagca 


acctaattaa 


55 


catccagctt 


gtggctagtt 


aggctgtttg 




ggaaattcca 


atgatttata 


caaatatgtt 




ttacatgtta 


gactacttca 


tactgtccca 


60 


ttcctactct 


ttgcttcaga 


ttatagagat 
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taaataaata 


aaaagtattt 


atccagtatt 


48660 


aagacttccc 


actaagctaa 


atttgtgtat 


48720 


atctaatttc 


tcttcactca 


tcctatatat 


48780 


aagattaaag 


aatattcaga 


cgtgattttt 


48840 


taatgagcaa 


tttgtaagtg 


tactataata 


48900 


aaaacatgca 


tccaactatt 


aaaggtaatt 


48960 


gtaaatgtaa 


gagaagcctc 


aaagggcttc 


49020 


ttctatattt 


gaggttgtaa 


aataataata 


49080 


tcaacagttc 


taacagtcta 


acagtgatgt 


49140 


catagagcat 


gaaacagatt 


caggaagaat 


49200 


atctgtagtt 


gaacttagtt 


taaattaact 


49260 


ttatttttct 


tgattctgag 


tcctgatcct 


49320 


tcaagagaga 


tatactttta 


tctttaataa 


49380 


tgagttgaag 


gccactgctg 


tggctactga 


49440 


actggaaatt 


tcctgtttct 


ttggtttgta 


49500 


ataatagctt 


tatagtagca 


catatctatg 


49560 


gccattctaa 


gtctttattt 


catgtaaatt 


49620 


taaaagacct 


tcctgaattt 


tgattaggag 


49680 


aattgctagc 


ttaacaatat 


taagtattct 


49740 


atttaggcct 


tcattaatat 


gtctcagcaa 


49800 


cgtcctttga 


cattcattgt 


attgattttt 


49860 


ctgttgtctg 


aatacatctt 


tattttacct 


49920 


ggatcccatg 


ttgacagctt 


ttgccttcaa 


49980 


tggcttacat 


aatgtcttat 


gagaagtctg 


50040 


cttttccccc 


tctgctgtta 


aaacattctc 


50100 


gttatgcttt 


ggtgtagttt 


gatttttctt 


50160 


gatctgtggg 


tttatacttt 


tatcaaattt 


50220 


ttaaattcat 


gctctcttct 


tggactcaga 


50280 


ctgatgcttt 


attaatgttt 


ccagactttt 


50340 


tctatttcca 


ccttccagtt 


cactaacatt 


50400 
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ttcttctgga aaacttaatg tgttgttact cataggcagt gatttattca tttcatatac 50460 

tgtagctttt atctctagaa gttctatttg attttttata tttttatact ctttgtgtcc 50520 

aagtatttct tgaaattctt atgcacattt ataatgaata ttctaaagca cttctctttt 50580 

attgtaatct ctataattta caggtctttt ttattaattt ttctcctggc tatttgttat 50640 

tgtgattttt ctgtttcttg ccatgtatag taatttggga ttagatgctg gatattatga 50700 

tttttatttg attgagtacc caaatttttc ttttttcaca agatattcca ttttatttct 50760 

tgcaggaaat taaacttaaa gaccaatttg attcttttga gacttttaca aaggcatgat 5082 0 

15 cgggagggtg tattagtccc ttttcacatt gctataaaga actacctgac actgggtagt 50880 

ttataaagaa aagaggttta attgactcac agtttcacag gatctacaag aggcatggct 5094 0 

ggggaggcct caggaaactt acaatcatgg cagaaggtga aggggaagcc agcatgtctt 51000 

20 

atgcagtgga agcaggagga ggagagagtg aagggggaag tgctacacac tttcaaacaa 5106 0 

ccagatctca tgagaactca ctattatgtg aacagcaaaa ggagagtcca ctcccatgat 51120 

25 ccgatcacct cccaccaggt ccctccccca tcattaggga ttacaattca acatgagact 51180 

tgagtgggga cacagagcca aacagtatga gaaggtttat cattgttatt aattgggggt 51240 

atgctttact aataatttgt ttattattaa cagattaatt attaaagcat taataataat 51300 

30 

tgacttatta ttaataacct ttctggaggt ctctattcta tcccccaagt attcatggaa 51360 

gtctctgcat tccagctgat tggaacttga atgtctccca gcattgtgtg agccctggta 51420 

35 atcatacagc ttacaactac ctggctagtc cttgactgta cttttaaaat ttcaacctat 51480 

aaaggtacag ctttgtattg gacaacaaat ttaggatact tgcagatttc tggaactctg 5154 0 

tctctggaga gctatcccct ttattgttct cttctcatta attccagctg tttcagtgcc 51600 

40 

ctcaaattct ggtttttgtc ttatgaattc agcaatatca gaataaatct ctctccctac 51660 

catggtatgg aaagtttctt catgcagaaa accagagcaa ttattatatt cactttgttt 5172 0 

45 tttcacttgt caatgttgaa attcctgcac taactttgtc caatgtccgg aaagagttgt 51780 

gcttattttt ttccagtttt ctttatgatt taaaaaattt tattgagaca agatcatcca 51840 

ggttcaagtg cagtggtaca gtcataggtc actgcagcct tgaaatcctg ggctcaaagg 51900 

50 

atcctcccac ttcagcctcc caaagtgctg ggattagaag catgagccac tgggcccaac 51960 

ccagtttttt aactgtacag agtggaagat taagtctggt tctatcataa ttggaagagc 52020 

55 tgagcttcaa ctttaaaaag atgctcatga atgtaaatat tagcattttt atttttcttt 52080 

atatttaaaa tttacttaag cttgtcaaca tatgtaactt aattaaatca atacttctaa 52140 

tttaaaatat ccaagcagaa aatttaagta ttatcatttt tatatgctgt tatttttagt 52200 

caccttattt aagtttcatg aagtagctat taaaagaccc tagcacattc cagttatttt 52260 



60 
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atatttaagt taatttgtga tattttccaa tgttttatat agtaatatcc agacagtttt 52320 

aatatatgct caaatgtctt ttcttcacac aacaaaaaat acaagaaata taataagaca 52380 

5 

aagttgtaca cattccaggg aaaatattta aatcttttaa ttaattaaaa tatgtccctt 52440 

ggtttaaatt ttgagtttac tgtatgaaaa acacaggaaa gatgtataaa gcaacatcct 52500 

10 tttgattgcc tggtggatag aaaaacaaaa aacaactaaa tttgagatac agtttgaaca 52560 

aagctgacaa tgcttttgcc ttcctctttg aatctcttca agatcagcaa tttatttttt 52620 

tactgactca ggtttcttgg gattgctgct gctcatgtga ttcccaggaa cttaactact 52680 

tgtgaataac tctgtcaggc tgataatact agattaagaa aacaattcat cttcaataag 52740 

tacactcttt ctagcagaag aagcagttca gaaagttcct tgtggtttcc cttctgaatc 52800 

20 ttttcatcgt caatttcagc aaagcgtggt attttccccc agtaaaaggt caagatagct 52860 

tgttttaatc cttctcagac tctaggtaca gtttcaccag acacagtggg taacttgctc 5292 0 

tcatctctta aatggatgct ttcagcttgg ccaggagggg ctggacatca agaggaggaa 52 98 0 

25 

agtcgccgtc tggatggggc tcaggaatca catagtgctg tattagctag ctgtcatcta 53040 

tgttgattct ttttccctta tgccagcagc aaagtgcttc aacccttgct gattacttcc 53100 

30 agtaatttat tttttatctt aatttttttc tacttagtat tccctcatat ttggaaaaag 53160 

atgaaaatat ttccacctta cctccagctc tgaagccaca gacactgtgc ccctaacagg 53220 

gtcaccaaat tgtctaaata acatacatgc atatatatat atatatatat atatatatat 53280 

35 

atatatacac acacacacac acacacacac acacacacac acacacacag gcacacatgg 53340 

aatatatata tgcattttta agttacttga aaatatattt aaatacataa ataatttgtg 53400 

40 tattacacac aaatatgtta tttgaaaata catttaatta tataaataat tgtatattta 53460 

aatacattta aatatataaa taattgtata tataaatatt tgtatattac atataaaggt 5352 0 

atgtttttaa aaacataata tatgcaaagg gagagataag tgaaggagat agaggaatat 53580 

actaatggag aaaggaaaag gagatgctgc ttactagctt ctagcttttg ttttccagca 5364 0 

atccaaagaa taatatctca ctagagattt tatatgtgct tcagaaagat gtatagaaag 53700 

50 agaagagttc ctagtgtatg gggtacgaga ggtatcatgg aatggaaggg ataacatgat 53760 

aattaagaga agcacagatg tccttaggga atagaagatt ttctggtcct gaaggagagg 53 82 0 

agttggaaag tgggtaacga aaactgagat gcagatggga cctgggaaac agggctagct 53880 

55 

aggaaacaaa agcacaaata attgttttgg gtccaaagag ttaaggcctc cttattctct 5394 0 

aaggagagat ccttggaact agaaaggcca gagagacaga cttgctcatg gacctgcagg 54000 

60 aataagagat attttatgga taatgtgcac agggagttgt acgtatactt agaggaactc 54060 
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caaaatttct 


ctccaacccc 


ttgttgtcaa 


acataagcat 


ctgtttttga 


gctctgacac 


54120 




agaaacttgg 


gataactaga 


tgacctctca 


gcacaatagg 


ctctagcaat 


aggaaattga 


54180 


5 


cttgagtgtc 


atcagcaaga 


cggtggaata 


ggaagcctca 


ggccctcatt 


tccccatgga 


54240 




gagactgact 


tgacaacaat 


gtatgaacta 


caatgccttt 


gtgagaactc 


tagaaactag 


54300 


10 


ttaagaggtt 


gcagcacctt 


aggtgaatgc 


aaagccagga 


aaagatgcat 


caaaatgagg 


54360 


agaaattgtt 


gtcgcatttt 


actcattcca 


gccactcttt 


caccctggca 


caatgcgact 


54420 




tgatctacag 


aaaacattaa 


agttccagtg 


gaacaggaat 


taaaagaaat 


taaaaactgt 


54480 


15 


gtaagcaaaa 


actcagtttt 


atgtaaaaaa 


aaacccccaa 


tttcccttaa 


agaagagaaa 


54540 




gggctggagt 


cctttaaaat 


taactgccta 


tttttctttc 


tgaggctagt 


gagccttatc 


54600 


20 


tctgtctttc 


ccaggcatgg 


tgaagcattg 


tttttctagc 


tgtgcagctg 


caagatcact 


54660 


agacagataa 


tctcaagtca 


taaaacatgt 


tgtttcttga 


aaaggaagaa 


atgatgtaat 


54720 




gcatgtctta 


attaaataac 


tgtctttgtt 


tctcacttct 


gtaatatgct 


tcctcctgca 


54780 


25 


cagatctccc 


cccaccccac 


aaaatgctta 


aaaggtaacc 


acattctttg 


ttcagggctc 


54840 




attcctttgg 


atgttaatct 


gactggcttg 


gtgcacctaa 


aaaattaaat 


aattcctccc 


54900 


30 


caacctctct 


gtctctctga 


ttacttaatt 


atcccactgc 


accagattct 


cctttaaaga 


54960 


ggcaagagga 


ctagaccatg 


agtccagcat 


tctgggtttt 


gtgggggtta 


cccaaggtac 


55020 




tggtttctgt 


cttgcctgac 


tcagagtgca 


gagttgaata 


gcatgttgtg 


agcctcggag 


55080 


35 


aacaaaggca 


ggcacactat 


ttcaccacta 


gggagattgt 


agtactgcag 


gcagacacca 


55140 




gggggagaaa 


gaagaagaag 


caggcaaaac 


tcttttacta 


ggaaattaca 


catacatgtc 


55200 


40 


ccaaagaagt 


ttgaagggcc 


cccagaatct 


ctagctatgc 


taacccgtaa 


aggtcttcca 


55260 


ctacataaag 


tcagtttgta 


atgactggaa 


aaggttggtg 


ttgttttcaa 


atgcacaaat 


55320 




ctcaataaaa 


gatcacaagg 


catacagaga 


agcaagaaaa 


catggcccaa 


ctaaaggaat 


55380 


45 


acaataaatc 


tccaaaaact 


aactttaaag 


aaataagtat 


ctatgagtta 


catgatcttg 


55440 




taaagatatt 


tgtacacata 


tttattgcag 


cattattaat 


aatagccaag 


aggtggaagc 


55500 


50 


aacccaaatg 


tacattaatg 


gatgaataaa 


gaaaatgtgg 


tatatacata 


caatgaaaca 


55560 


ttaatcaacc 


ttacagaaga 


aggaaatcct 


atcatatgct 


acaacaagaa 


tgagacttgg 


55620 




agatctcatg 


ctaagtaaaa 


gaagccatca 


caaaaaaaaa 


aaagaagcca 


tcagaaaaag 


55680 


55 


acaaatactc 


tgtgattcct 


ctcatatgag 


gtatccaaag 


tagacacata 


gcaggacaag 


55740 




ctgtagacag 


aacccctcag 


acaccaagtt 


aaagaaggaa 


gggct ttatt 


tqqccqqqaq 


55800 


60 


cttcagcaag 


actcacgtct 


ccaaaaacca 


agttcccaga 


gtgagcaatt 


cctgtccctt 


55860 


ttaagggctt 


acaactttaa 


gtgggtccat 


gtgagagggt 


cgtgatcaat 


tgagcaagca 


55920 
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ggggttatgt gactgggggc tgcatgcacc agtaatcaga atggaacagg acagggatat 55 98 0 

tcacagtgct tttccataca atgtctggga tccatagata acataaccag ttagctcagg 56040 

5 

ggtcgatctt taaccaggcc cagggtgtgg tgctgggctg tctgcctgtg gatttcattt 56100 

ccgcctttta gtttttactt cttctttctt tggaggcaga aatgggcata agacaatatg 56160 

10 aggggtggtc tcctccttta gtcccccgct ttgagaatct cactcattag tgggagttct 56220 

cactttcatt ttcactaccc atgtcttctt gcaagacaga tcaatagtga ttcatatagt 56280 

acatttgtgc tgaagcattt tggtgaacta aggtagcgat gaagcttttt atcatttgaa 56340 

15 

gaagtacagg tagcaaacaa gagagcagta agcaggtttc tattactatt attactccta 56400 

ttataagagt tttaaatcct cctagcactg ggaaccattt tccaaacatg gccccaggat 56460 

20 catatccatg ccacacttgc acgggcacat gtgccagttt tgtcatattt ctaactatgt 56520 

cttcaactac ttgcccttgg tcatctatgt gtagatagca attagtaagg ttaaatgtcc 56580 

tacagacccc tccttcagct gccagcaagt agttgggagc caatctattt tgatagatag 56640 

25 

catttctcat ctgagtttct tgctgggcca gaatagtcaa ggctctgctg gtcttattag 56700 

tgattatttc taagacagct tgtaaccata tgctttggtt gagcatgtaa atgggggtcc 56760 

30 agtatcccca tgagctgtct tttgcccaag tagcaggcca ataatattgt atgattcttt 56820 

caggggacca tttattatct ttccaatttc ttatggctat gtttctcttt tcatgggaag 56880 

catagacagg gaagcccagg agttcacctg tctttatggg cagtaggaag aaagatgatt 56940 

35 

taatagtgcc aataacacaa ctactttccc actggtcagg taatttggca taagttctat 57000 

gcccacatat ccagtataat ccagtggggg ctgtccagtc ctggtggtac tctgggtggg 57060 

40 ttcacatggt ttacaacttt gggaatttac taaatggatt tttcttagtg tggtttgaac 5712 0 

gccacggtgg ctgtttttgt agtactatta tacagttttt gcccaaggca gctgggtctt 57180 

cccacaggaa gggtgaagtc cttccccact cttgctatac agtattgtct aatgattgag 57240 

45 

gctttcaaga cccagaagtt atcagggtga ttctcttgag ccaggaattt atcaggaact 57300 

gggtctgtag gtactaattc tcaggcttcc catggccatt gatctcccag tatagttcct 57360 

50 ccgcataaat aacatgaagt gacattgaga gactgggcta catgcttggc taattgcaaa 5742 0 

aacaaatttc ttgtttttcc tggaatttct ggcactggca cattcagttc atcatagaag 57480 

tttgaaatac tggctcagga gagggtttat aaacttcttc tcaaactatg atatttactc 57540 

55 

aaggatccag tccagcccca tcgatttcta gggttacatg ctcccctttt ttccagggag 57600 

gatcaagggg gttggttatt accagttcta aggggttaca cggacccctg gtataggaag 57660 

60 ggccactttt ccttttctga aggtggacag gatttttttt ttcatttttt atccaagtag 57720 
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cctaaatgac acaagaccag tatccacatt 
tgtacttatt ttttgtcata tagcctcttt 
5 acttattact attaatgaca gcacaggcat 

ctttttcttc tgttttggct aacactttaa 
cagtccttaa tcttatttca aaaactgtgg 

10 

cataacacag gtcatttcct gggttacata 
ttattttcag agttccagta cacttataat 
15 tttgtcctac ctcagtgact tggtgtatac 

cagttgaagt ccttactgta caagttcaaa 
ttcttatgct ttggccatgt gtggaccagt 

20 

gttgtcctct tcagagtcac tttgcagggg 
tcacagtcta ctgatgttca aggatggtct 
25 ccaatacctc tacatattta tgttcagctg 

ggtttagggt gttgcaaact tcaatggtta 
cttggttaat ctagcatttg ttagccaatg 

30 

agcatggaag gcctttatat tcaggttttg 
taacagggct gtcactgcca gggcccctta 
35 ctagttgttt tgagagatag gccactgtcc 

ctccaactgc cattttttct ctttctgaca 
gtagccccag ggctgaggcc aacatgagtt 

40 

gttggttgta atagatgtag ttcatctaat 
atgttgactt taatcttgca gctatttgat 
45 atgggaatcc aattgcatct aaatagacgt 

tcgctttaca atgtcttatt tttcctccct 
tagccaattg gactgaagta caagtgccac 

50 

gtccaccaca ataccaccac acatctgctc 
gctcttgaaa attcttaagt tcactacatc 
55 cttcctattg tgagagacac gaaatgaact 

tcgggggctg aacttcaggg tgccagactt 
ttattactcc aggctgtaga atcctggaaa 

60 

aggaccacct tagtggaaag gggactgtct 
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cattttcaca 


cagtcctaat 


tcatgacaaa 


57780 


tttaattaag 


agaaccatat 


cctatttcta 


57840 


caaatttcaa 


ggtgactttt 


ttgg'gcaccc 


57900 


tcatgtcatt 


tatgagcccc 


cacgagtcct 


57960 


tcatgggaag 


ctcagatggg 


tcataacaca 


58020 


ccttgtatag 


aataacatta 


tacaaacaag 


58080 


aaccataaaa 


taataggacc 


atagcaactt 


58140 


actgggaaca 


gtcctcagtc 


tgaggaaggt 


58200 


ttttaaggaa 


aatgagtccc 


atgatgagtt 


58260 


cagcttctgg 


atgtgactgg 


agcagggctt 


58320 


ttggtgaaac 


tgctactgtc 


cacatacagc 


58380 


tggaggttgg 


gcctgctaga 


ataaactagt 


58440 


agctctctga 


taccgggaac 


aaggtggcag 


58500 


tgggggattt 


tcacatagca 


agctttgata 


58560 


atgtcctttg 


gtattcatca 


aagttaccac 


58620 


cctaagcgtt 


agtttatctg 


cttcttgtgc 


58680 


gacatggggg 


ccagcctttg 


gaaactttgt 


58740 


ttggccaggg 


cccgacagtc 


tgggctaaaa 


58800 


catagagtgt 


aaagggtttt 


gtcaggtcag 


58860 


tttcttttaa 


ctcatgaaaa 


gttcattgct 


58920 


ctacatattt 


atttacagtt 


atctactaaa 


58980 


ttcaagcttt 


aaagtgatct 


ggtattcccc 


59040 


gagagtcaaa 


agacccataa 


ggggcttctc 


59100 


ctggctgatg 


aaatgccagg 


gtgaaaggga 


59160 


tccagttatt 


cagcagagtg 


cccagtaaag 


59220 


tgggatgaat 


aagggctgac 


tgattgataa 


59280 


ccttcaggtt 


tccaaggaac 


gctaagtttc 


59340 


tagtgttggg 


agatggcagc 


tggatggccc 


59400 


tqqqatatag 

Zj zj zj ~j 


cagagagagc 


ttggcatgac 


59460 


agagatacca 


tgcagcccac 


ggtccactgg 


59520 


gggcctctgg 


cctgccacgt 


gcacaagcat 


59580 
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aacaattgct tttgtttaac gtgcaaatgg 
catcttggta tcttgtctta actgccaaag 

5 

tctagtaaaa tgaatgtatg attttaggaa 
ttgctcttta gtggtcaaca gaacgttgga 
10 gggcaagact cctggttgac actggagtct 

taatttacta atgcccagtc tgaggagagt 
gagagctgtc tttgacttgg caagtcccca 

15 

atagtttgag gcaaaattga cttggttatg 
ggaaagaaga aagagtaata gaatagataa 
20 gtagggtttt cccctgggac tatggcccac 
ggtgtgatga gtccatcctt tttccactgt 
agggtagggt ccttcctagg ctggctcgaa 

25 

aatgtgatct tcagactggt gctggcttac 
aagactttta gttttgaggg aaaggaaagt 
30 aaattgacct tttgttttaa atgtggggat 

ctttctactg agaaatttcc tttagcacct 
caaacagctt tttatatttg acgcttcctg 

35 

cacccttata ttagtgtgtt attaatgtta 
atttatttaa tttttaatgt tagatcataa 
40 tttataatct ttgttaaaga gcaggttaat 

ttaatgttca gttcacagaa aaactggatg 
cacacagaat ttcctttaca attgacgttt 

45 

ttaaaaatct tttaatgtag gtaaaaatcc 
ccagaggtgt attttacttt tcttatacac 
50 tacattcagg aggcctagtt acttttaaat 

ttataactct ttttctttca tgacttttgt 
gacttattac aaacatttct ttctttaaac 

55 

tactatataa gattcttttt acataaattc 
gggataacca ttcttttcca aagcaaactt 
60 gctacaagat tagaagttac tattatacat 
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aatatttgat 


ccattttaac 


caggcatttg 


59640 


tttgttttaa 


gtctttaact 


tctatgatcc 


59700 


attacaaaaa 


ccagttgggt 


cagtccatcc 


59760 


ccaactatgg 


cataaaagtt 


ctacattggg 


59820 


ttaccaaagt 


ttcccccaat 


caaatagtcc 


59880 


caggagggac 


agaggtactt 


ttctgaagta 


59940 


cagggtataa 


taaggcaagc 


attaaatgca 


60000 


ttaataacta 


gatggtcagc 


aatagagtga 


60060 


aagagttaaa 


tttttctcag 


ctttagtttg 


60120 


gactctggag 


cgggtggagc 


tttcttgact 


60180 


atgaacagca 


gtcttggtgg 


ttagcagcac 


60240 


tttcctcttt 


ttttttcact 


ttttgatgag 


60300 


cagaaattgt 


aggggtggca 


catgtgttaa 


60360 


gggagataaa 


ccaagtatat 


aatttttaag 


60420 


gtcagcagtg 


gactttatag 


tccttggtgc 


60480 


attttttatt 


agtttttaga 


ccaaagaagc 


60540 


tatgattttt 


ataccagata 


agctaaatgt 


60600 


aacttagttt 


taataaaact 


tagtaggcat 


60660 


ggtaagattt 


ttatagactc 


tttttaacct 


60720 


gctttaagag 


aaacccattg 


tgtttttatt 


60780 


ataccccttt 


aactttagcc 


aatatgttta 


60840 


taaaacttgc 


ttaaaccttc 


aaaacaattt 


60900 


acattcttat 


gcctccttat 


aatgctttta 


60960 


cttgcacata 


aactgttttc 


tcagtagttt 


61020 


tatacaacat 


ttcttgcatc 


aattcttttt 


61080 


agacaattct 


tcaacatgcc 


tcaactttct 


61140 


aaccagttaa 


tttatttcag 


gacaagaatt 


61200 


tgcccccgct 


tttttccccc 


tttttttggg 


61260 


cctttatgtc 


tgtggattag 


actgtctaag 


61320 


gttatactgt 


taacttttag 


caaattttac 


61380 
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ttttgttgaa aaccttgtac gtttgggatt 
gtttagtcca aattaactta gaattggtat 
5 gggaggaacc gtctatcatc cagtcctgaa 

tgtatggcaa ttaagattta gatcccctgt 
agtggttaat gttaaatcat cttttttttt 

10 

tgtgctcaca atgaggtttc ctctaaaagt 
agttgccgct acagagtgaa tgcatttgtg 
15 tgataggaag tctacaggtt gtcagtggcc 

acttacagca aggtggtatt ggagtgttgt 
ttataggttt taaatttacc ctgtctttta 

20 

ctacttgtat atctctttct ttctgtcttt 
ttttgcctct gtatctttct ctctctctct 
25 ttgactccct ctttgtctct ctgtctcttc 
ctctgccagc tgcttatgct gctgttctct 
cagctgtacc caagtgtcta tatacgggaa 

30 

cttgtgccac acctttgaaa caagggacct 
ctccaatact ggccagtcta tttcacacaa 
35 actgtaatct cccttaaatc ctttcttgaa 

tttgtgcctg gcccatgctt cctgaagaca 
tgggtaaaaa gggcacacac acacttttgc 

40 

atcagagtgt caggaaatcc aagccgggtc 
agtcaagtca aaatcaaaaa ccaaagtgct 
45 tgcttccact taaatggagt gggcaagttc 

ccggactcca agtgccagtt ccttcatggt 
gcctgctgtg cactgctctg atgaggcatt 

50 

tctcaggatc tgcttcactc aagctggctg 
ggcctaagcc acctaaggag ctgtctcgac 
55 ggaaccaaga aatgtagcag gacaagccac 

aaggaagggc tttattcggc caggagcttt 
ccccaagtga gcaattcctg tcccttttaa 

60 

gagggtcttg atcaattgag caagcagggg 
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tcaattatcc 


tttgctatta 


gtaagacttt 


61440 


agacggcttt 


ttcttttcct 


tccattacct 


61500 


gggagttcct 


cctaggtctg 


gtcagacctt 


61560 


taggaaacct 


gctgggttaa 


gggaattttt 


61620 


ttcccttagg 


atacttctga 


actggtgagg 


61680 


tatttttcta 


ctttcttctg 


ttagcaaagt 


61740 


ccatccgcag 


attactaggt 


taaagatttt 


61800 


tcagtgcttt 


cgggctacgc 


acttgtttac 


61860 


agggtcacag 


agaagacctt 


caattatcaa 


61920 


aaggaatagg 


gtacactgtt 


tttttcttaa 


61980 


gactttctgt 


gtctttctct 


ttgactttcc 


62040 


ctctgcctct 


ttctttctct 


ctctctctcc 


62100 


ctctctctct 


ctctgctggt 


ctttccttgc 


62160 


caaccactgt 


gagtgggtgg 


ggtctaaaac 


62220 


ctggtctggg 


tgccctggct 


tacaagttac 


62280 


gtccaggctt 


ccttctgatg 


gccaacccac 


62340 


agttctagtt 


tttcctggtg 


tcatagtaac 


62400 


atttttcaac 


atagctccta 


gtggtcttac 


62460 


aaacaccatg 


ctcacactgc 


aaaacaaaaa 


62520 


agtttacacc 


aaaccagaat 


caaaaccaaa 


62580 


aaaaccaaaa 


ccaaagtatc 


aagcaatcca 


62640 


ggtacaggca 


caccatgggt 


gatcaggcca 


62700 


caaagaccag 


tcttaccaag 


ttttagatgt 


62760 


gttcagccac 


tgcattgatc 


ctccacaggg 


62820 


ccactggggc 


aattgcctac 


ctgggagtgc 


62880 


gagtccccct 


caggtatgct 


ccacagggca 


62940 


catccattaa 


tcaccttgtt 


tcctggtcag 


63000 


agacaaaacc 


cctcagacac 


cgagttaaag 


63060 


ggcaagactc 


acatctctaa 


aaaccaagct 


63120 


gggcctacaa 


ctctaagggg 


gtccccatga 


63180 


gtatgtgact 


gggggctgca 


tgcaccagta 


63240 
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atcagaacag aacagaacag gaaagggatt 
atctatagat aacataaccg gttaggtcag 

5 

gcacagggct gtctgcctgt agatttcatt 
ttgggggcag aaactgtgca taagacaata 
10 gcaaacatgg aacagaaagt aggatggtgg 

ggtggttgtt caatgggtat agagtttcag 
gttacacaac aatgtgaaca aaattaatac 

15 

ataatgaatt atatgtgttt tgtaccacaa 
ttgatttaaa gaaaatattg ttctgtttct 
20 aaaccctttt gctcctaggt cagtagctct 

ctccatcagg agacatttga caatatcaga 
gattgtgctg gcataggcca gggagtctgc 

25 

ctactgtaaa gaattaccca gtccaaagtg 
cagatcatta atcagaacgt gttattgata 
30 tagaattcag actctggaaa aaactgggaa 
gacaagagaa aggagaattg ttttaagctt 
gggaggattg gggggtgagg aagacagtca 

35 

ggttctgtgc taggaacctt acacgtgttg 
ttttaaatta aatagcacag agaggaaaac 
40 gcaactttaa ctggtttatt agcacattcc 

gaagagacca aagagactgg gctccttttt 
actgtttcac agatgctgtg agggtgacaa 

45 

ggcactctgc gaaagaggaa ggggcgccag 
ctgcccctct ccccaggctt tctgtgtgag 
50 acttaagaca agccaattac atttcataat 

ccagccttct aaatggtgct tcttaactcc 
ttaagtgcag aatccatggc cctctgtggt 

55 

ttcattctgg aagctccccc aaggaaagga 
attgccctga ctggctctgg tttccttgcc 
60 ccacccaagg ttttgaaggg gcacagtgcc 



ttcacagtgc ttttccatac aatgtctgga 63300 

gggtcaatct ttaacgaggc ccaaggtgca 63360 

tctgcctttt agtttttact tcttctttct 63420 

tgaggggtgg tctcctccct tagagacaga 63 48 0 

ttactagggg ctaggaatag gaaggaaagg 63 54 0 

ttctgcaaga tgaaaaagtt ctagaggtct 63600 

tagtggtctg tacactaaaa agtggttaaa 63660 

tttaaaattt taaaaataaa agttgagaaa 63720 

agcacacctg cctttacaca gaacacacaa 63780 

caacatgaat gtgaggggat tttgttctcc 63840 

tactgtttct ggttgttaca catagggtgg 63900 

taagcatcct acagtgcaca ggatagccac 63 960 

gcattagtag cttttttgag aaacctgtcc 64020 

taccagtctt ctcccttgta gttaggttgg 64080 

tgtaaaaggc tgaatgagtg taagctgaaa 6414 0 

actaaagcca aaatggaaaa aaaaagaggg 64200 

acagaccgct tttaggccat cattccgaag 64260 

tctcacataa tccttaaaac tcagaaatga 6432 0 

tgactggaga gggagccttt gcttaggtag 64380 

cctgaaggga aagggggtgt gattgctcct 6444 0 

aatcaaagct caggaggaga gctgcattcc 64500 

agatgcaggg cacccactgg aaacacagac 64560 

gagcttgggt gagcaaggtt ggaggtgatt 6462 0 

tccattcctc ctctcagatt tattgttaga 64680 

gtctgtgtca ttcagacgct gagaactaat 64740 

tttcttcaac aggagagtta atgtgtggaa 64800 

gacggtgatg gttcaggttg tgtttctggg 64860 

ggaaggaagc ttgcagggtg gggctttgcc 64920 

tgatacattg aagtcagctt ccaaagaatg 64 98 0 

tctgtcctca caaaaccagc ctgcatatga 65040 
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gtttactgaa 
tatttcaatg 
gaaaaagagc 
attgaaaggt 
aacaaacaaa 
attactatta 
tattattatt 
ctagcactat 
ttataaatat 
taacagatta 
gtctcgaagt 
ttggctcttt 
aaatctacaa 
ctgatgactt 
ttaaaacctc 
gctggaagac 
aacttgaaac 
aaatggaagg 
gtgagtcact 
tgtttagacc 
tccaagctga 
gaggaccaac 
tgcagctgtt 
ttaggaagag 
agatccaaga 
actgcaaata 
ttgcattcat 
ttactaacat 
tgtctggtaa 
ataagtaagc 
gttgtcagta 



ttcatttagg 
cttccctgga 
agccagagga 
tgctttcagc 
caaactatat 
gctatatatt 
atggttagtt 
tactgttatt 
ttttgctcat 
gaaaactgaa 
ggcaattcac 
tactgacctt 
tctgtgagta 
tcagaagatg 
tttggagctt 
ttttctcatc 
tccttgacag 
agtatctgtt 
tacctaaggt 
atttgtaaac 
gcttagcaac 
gacgcgagcc 
tctccagcct 
gtagaatgat 
ctggttttac 
tttcttcagc 
aacttaaact 
gcagagctgt 
aatatgataa 
agagttcatt 
ggactagagg 



tcctgctgaa 
ggggcccaac 
ggaagatgac 
ctgccaagac 
atataaaaaa 
aatgaatgtt 
atagacagtg 
agttacatgt 
ctttatgaca 
gcttcaagaa 
acataaggct 
gctagaagtt 
tcacatcctg 
aaggtaagta 
ctgaggactc 
tcactattcc 
cagttaagag 
tttaactctg 
tgtgtaatta 
tgggcggtct 
aaataccaca 
tatgtgttat 
gagtaaactc 
ctagagattt 
agcttcccgg 
attaactctg 
ttattttgca 
tgtaaggatt 
acatatctga 
tattagtctg 
aaagggaggt 



tctttcccat 
ctttctggag 
ttgagggtag 
ttggggaatt 
caatttttta 
cctttaaagc 
gaatattgtt 
tattgaaagc 
attctccagt 
cagacttgcc 
ccatgactcc 
tacagncacg 
tatagctgta 
gaaaccgttg 
agctggaacc 
cttgccgtga 
acactagggc 
tgtccgctgt 
atgggagcag- 
gtgctttgag 
acctacttgg 
gagtccaaga 
agacacaggc 
atgcaaaggc 
ctgtgtgaac 
ggccacacat 
ttttcttatt 
aaaagcactg 
gtgctaataa 
aaaattcatc 
tgttgcafat 



ttgttccttc 
ggcaaagatc 
gcagcagttg 
agaaaagcaa 
aaagtttttt 
agtgtagtca 
ctttctctat 
ttcaaagcag 
gttggtattg 
taacaacagg 
tgaactctca 
gaagtgcagg 
aacactggaa 
atgggactga 
aacgggcaca 
cctcattaga 
cttttggaaa 
attctaattg 
agctcttgtc 
aatggtgccg 
tctttatgct 
cttgttctca 
tggcaatgcc 
ttcagactcc 
cagggtgaac 
atatgcaata 
tattacatgt 
cagggtaaaa 
atagtattat 
aaatatatgg 
tgcaaatttt 
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tctgtttcaa 65100 

tgttgttaga 65160 

ggcaatcagt 6522 0 

gataaaacaa 65280 

tatgttctat 65340 

ttattgctat 65400 

attatgatta 65460 

cataggcttt 6552 0 

ctcctctatt 65580 

aaacttgtat 65640 

caaatattag 6570 0 

aacatttcac 6576 0 

t aaggaaggg 6582 0 

gaaaccagag 6588 0 

ggtaggtaat 6594 0 

ggagtaattt 66000 

tagagtgggc 6606 0 

cataaactca 66120 

tcttcaaatg 6618 0 

tgaaaaggga 6624 0 

aaaaagtggg 663 0 0 

taccactgca 66360 

aggtgggaaa 6642 0 

aggaaagctg 66480 

aataagggta 6654 0 

ggtatattat 666 0 0 

ttttactaac 6666 0 

attaagaatg 6672 0 

taaacaagaa 66780 

caatacagtt 6684 0 

gtgtgaccat 66900 
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ttgcctctgg acattgagac ctctcaagat 
ttatctataa tggactgtgt caaacattat 

5 

ggttttcact gttccatact atcagtttct 
attggcaaag actgagttac tgtttttgga 
10 gagctaaatc tattttttct tacatagttg 

ccaatgagac catcatagtg ctcccatcaa 
ccgaacccac caaccagggg caggatagcc 

15 

ttattggggt aaatctaatt cagaacgtgt 
tgatatatgt cttgggactg gacatctgtc 
20 agggacttta gggtagaagc cacttggaga 

tgtcttaaaa atatattttc ccttattctg 
tttgtatttt tacaatacag tgcttttcta 

25 

ataatgacat attgtgtgta tgtgtatata 
tgactttttg caaataattt ttccctcact 
30 cctcttttct cctaacgttt ttgagaatgg 

ctcctctcac ccagaattta ttaaagaagc 
ggatctatct tggattggga gggaaacaca 

35 

ttatgtgatc tgaggtgagg cactctgttt 
ttccttgcct catgtacaat cacatgggtt 
40 aaaacttttt ggtaaatgat aatagagctg 

aactacctcc taagacagag cttcagttat 
ccctgtgtta gtctgctaag ggggatggcc 

45 

aacagaacaa tgcacactga ggcctttcag 
aggaaaaata actaatggat actaggctta 
50 aaacccccat gacacaagtt tatctatgta 

aataaaagct aaaaataaaa gagcttccag 
ttccgaggaa gtgaccaaag gggcaaactc 

55 

agggccttct gcactgccaa tactgcagag 
tcagagagat agaaagagag ttaactagtc 
60 gtaaatctca gcacttgctg tgagctggaa 
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tcatgttgct 


gccaccctgt 


gatgtgttta 


66960 


ggaaaaatcc 


tcatgaaacc 


aagattgtgt 


67020 


ttctctaatg 


gacaggccaa 


gcctagacta 


67080 


gtcagaacct 


gggagctctg 


tactttttaa 


67140 


gcaacaccat 


catgacatca 


caacctgttc 


67200 


atgtcatcaa 


cttctcccaa 


gcagagaaac 


67260 


tgaagaaaca 


tctacacgca 


gaaattaaag 


67320 


tggagagggg 


ttgggggaag 


tgccaagaga 


67380 


gtgagtgtgg 


agaccctaaa 


attttgctag 


67440 


aaactgtccc 


agaacttttc 


cacaagaggt 


67500 


aatatgagga 


attgattttc 


tattgttgtc 


67560 


tattttcttt 


tacaaaagta 


atggtggtaa 


67620 


ttccacaact 


gcatgcttaa 


taatccaaac 


67680 


gcaaattaga 


ggaaatatac 


aactcctttt 


67740 


aaatgatatt 


caccttttgt 


ttgtctgttt 


67800 


tgtattgcat 


gaagggagga 


gagaaagaag 


67860 


gagttttggt 


ttctagaatt 


gcctctgcac 


67920 


gactggattt 


caggacctgg 


tgggactggt 


67980 


tctatgaggc 


tcatgtgaca 


taaggtctag 


68040 


tgaatctctg 


aggtggatgt 


caggaaggaa 


68100 


acatgagaac 


atgcagtatt 


tggtttctgt 


68160 


tctaaatgat 


gagaacacat 


ggacacgtag 


68220 


agggtggagg 


gtgggaggag 


agagaggatc 


68280 


atacctgggt 


gatgaaataa 


tctgtacagc 


68340 


acagacatgc 


agttgtaccc 


ttgaacttaa 


68400 


agcttcagct 


atagggttgt 


gaactgcagt 


68460 


catgtccagt 


ggagtctggg 


agatgagggg 


68520 


aatgtggggg 


ctttcagatg 


tcccctgttt 


68580 


atggacttag 


aggtcctgag 


agctgggaaa 


68640 


gcaagtctca 


tgagcagacc 


agggcctgag 


68700 
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aggcagaaat gcagaagcct tcctgagcac 
ggcagagctg cacaccctcc aggatgtctt 
5 ggtttgtatc ctcattagga tcttgcagaa 

caaaaaggcc ctgtctttac aaatgctggt 
cagactatcc agatcttgtg tggcatgatg 

10 

gcttccttct ctccaaattt tacccaagtg 
ttcataggac ccttttttgt gagtagagtt 
15 gaggaagatg ccaatagctt agacttccca 

gagcaaggag tcaacggtga atcttgttct 
attttagatc aatttcttat cagactcaaa 

20 

ttcacattat catcttgagc tatgatgaag 
aaaaaatcca tgaattagga taaagttggg 
25 gcataaatgg aagacatcac acactcacag 
gtggggatca tggaggaagg agaacttaca 
cattgactga ttcccaggta aggctgagca 

30 

tcctctgaaa tagctgcaag tctggagttt 
aactgctttc tctaagatgg gggctttctc 
35 ttccttcaac ttcctgcaat tcagtagcta 

caccaagaag tccacctttt tgtgtatctt 
tacacttgga tttggatttc attggttttc 

40 

atgccaagct actacattac actacattac 
taggctgtct aagattttcc ttttccttga 
45 gagattttta tagaatgttc ctaaatttgg 

agtatggtga tgaatttttg ggtggtgatg 
tatcaagggt catgctacca acataatgtc 

50 

ggggtagtat tttccagatt tctcctctgt 
ttttatgtcc ttccctagag aacatcttac 
55 aaaaattatt tcctggcttt taaggtcttt 

aggatatatt tctgctatct tttttttttc 
ggacaaatct atgtatttaa atggaactta 

60 

tcccatgaga gttgtctata aacattgtga 
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ctgccagaca 


cctgagactg 


tggcttctgt 


68760 


ttcatatttc 


tcactccaga 


gtttccacag 


68820 


ggcgataatg 


cattagtgcc 


atgggagcgg 


68880 


tttcatcctc 


aatctcactt 


tctcttccca 


68940 


gtattgagct 


tggggatcat 


tttggcatct 


69000 


acttctacac 


tgttgaactc 


tgcttaccca 


69060 


tctgaggagg 


gcaggatggg 


gcaaagaggg 


69120 


cctgccagct 


tgctatgttt 


gatctgccag 


69180 


cctgtcgggt 


aggatgacag 


gggttgcttg 


69240 


taaacatttc 


ttttgaagat 


catcttattc 


69300 


ctagtgactt 


ctctccaggt 


ttaggcgaaa 


69360 


aaggaacatt 


ttatacaaaa 


aaaaaaaaga 


69420 


agaaaccaca 


ctgttatccc 


ccccatgctt 


69480 


cctctgagca 


ttttgaaaat 


acttttactg 


69540 


gatatctatg 


cccagtgatt 


gacaggaaat 


69600 


caaaaggact 


tagtagtcct 


ttcctcagag 


69660 


ataggtaagt 


ttccctttca 


tagatgtttt 


69720 


tttgtttttc 


ttacatttgt 


gtatttgatg 


69780 


gtccattttt 


cttgtcttat 


taactaaagt 


69840 


cccaaatgac 


ctttttctgt 


tccaagatcc 


69900 


attttgtcat 


gatgtctcct 


cagggtcctc 


69960 


gaactttgat 


agttttgggt 


agcgctggct 


70020 


gtttgtctga 


tttttttttt 


tgatgtttag 


70080 


atgaggtaaa 


gcaccatttt 


catcaagtcg 


70140 


actgacacaa 


ttaaccttga 


ttatgtggtt 


70200 


aaaattacta 


cttttttttt 


taaccattgc 


70260 


accaaaactg 


tctcatttta 


ttaaaaaaac 


70320 


ttttgtttag 


tgggatattt 


tttgtttatt 


70380 


tgatgttatc 


tgaatgttat 


atggacaaag 


70440 


cctcattgat 


cactggtttc 


tcagggattt 


70500 


tagtgacgtc 


gaaggtactc 


tgtacatgga 


70560 
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atctgacctg accattttct cttttatctt agtttatcat ctctggctct ctatcaatcg 70620 

ccacagagaa aaggttaacc aagcttttgg tgagtaagag agctataacc cacaacccaa 70680 

5 

tggatcaaaa taaaacttag acagaactct ttaaggctag tctttctact ttgaaaagtt 70740 

gaaaactcag ttccagagca gttggttgtt gttagaatta ggagctgggc ctggatcctc 70800 

10 tgtatccttg ttcagagaac atttctccaa ttattgtacc ctacatcaaa gtctgcattc 70860 

aggggattat aatattccct ctgcccatgc cgaagaatgt atcacagaga aattgtgcct 70920 

gtttatgagg ttctttcggt gataactggc cttcaaattc aggttttcag tggcaaggaa 70980 

15 

gctgacagtg ttataaagcg gtctattggt tggggtccat tctttaagcc caggtgttac 71040 

aacccttgaa aaaaaaatga gtcaaagtgt tgttcatgtg aggtatccaa agtagacaca 71100 

20 gaggctacta cagtatacta cattacattt aggcctgatg tctcctcagg ttcctttaga 71160 

ctttctcaga ttttcctttt ccttgaggac ttcaatagtt ttgggtagtg ctggctgact 71220 

gtatcctttc atctatctca ccagaagtat aatactttta ttttgtttga gtataaattc 71280 

25 

ttgcacccta aaaagttgtc cttagtcatt tgtattggct aacaaaaaaa caaaacaaca 71340 

acaacaaaaa acaaagcttt acccgtcttt atcccttatt ccagcaaaaa ctagagttgg 71400 

30 aagtggcagg gagacaaagc ctggattata gggagaatct tctcttgtct ttaaagtttc 71460 

attaagtctt cgctcaatcc attatctctc aaggggttga tgttggagat tatgtgagag 71520 

aatgtcatac catgcaattg caccgagact caatttggaa gctctggcta caaaaatctc 71580 

35 

ccaaagccag caaggaagtg agaaacggca tccagaaaaa tgccaattct ttttctcatc 71640 

ggtatttaga gattattatt tggaaccctc agttagactc ggaactacta actagatctg 71700 

40 tgctgtttaa tttaactttc tttgaggatg gaaatgaaat gtaactttct atgatgatag 71760 

aaatgaaatg taactttcta tgatgatgga aatgcactac agctgtactg tacaatatgg 71820 

cagtcacatg tggctatcga tgatgtgaaa tatagtgcaa ctgaataaaa atttctctta 71880 

45 

attttacttt aaatatagat agtctatgtg gctattggca accatactga gtagcctggc 71940 

acaagggtag tgctttcaaa cttttaggag gaaatatatg ggcctgttac tccagagaac 72000 

50 aatcccacag tgaacatgca ggaagctcaa ggatgcagga tgtggtctcg gggtgggctc 72060 

agacttacca gagacatcga ggctttagcg agttctactc ttgtcacaca ttgcaccagc 72120 

cctttttgag attgaagaag agtgccatct caagacaaat ctagcttatt ctaagaaata 72180 

55 

tagactctag gcatcaagaa tactactggc tactaggagg cccaggagtg tacacttcag 72240 

tcattatgtt gggaaattgg ggtcccagag tatcaactgg tgagttccta aaagaagaag 72300 

60 tgcccatgtg ggattaagta acattctact gtatggctgt gaggtgtgcc ctggagccta 72360 
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cctacctgta actatagctc tcagagtact 
ggagtacaaa agaagtcact tctgaggtta 
5 ataaatgaga accagagtgg gaaataaaat 
aaggcaagtc tagtactgtg acttcaatat 
acttcccgct gcctcccaca cattacacca 

10 

cagccacccc atactgcact cagtgtggcc 
gcctgagact cagaaccacc ccctcactca 
15 atgggcaggc gagagtgagc tattaggttg 

aaccacaatt acttttgcac catgctaata 
ttccgccagg ggtttttgag ttgagcgaga 

20 

cactttgggt cacgctctct tggtaatgtt 
ctgtcctggg gaaagaattt aggagagtaa 
25 taaggtgaac cagttcgtat agtcattcat 

atttttgctg gacactggag aggcatacag 
gggacagata agggaataac ttacaattta 

30 

cattagcaat gatccttacc caacatttct 
cattctgagt gctctgtctg ccctggtggg 
35 cttaaatcct gcctcactgc agtgtgagtt 
tgtttcttac ttttatcatg attcacttta 
' tctggctgta agtattttta gatgggatgt 

40 

atttcttgtt attccttctt ttgaaaatct 
gaaagccagg tttatgtaaa tcaaagggga 
45 ctaatagagt ggaattgaaa atgttaaata 
tttgtagagg ttcttaatta tcaaactaat 
tcaaaatgag aatcttggta catggagtgg 

50 

cccaggagat cagaataggc caacaaggaa 
tggtttgaga agacatcaga ttcagtacta 
55 aagataaggg cttcttcttc taatgatcac 

gcaacattat tgttgctttt tgttatagct 
taagaaatat agggagatgg agagtgtaat 

60 

ttggaagaag aaaaataatg aagtttggct 



cttagctcaa 


ttctcttttc 


tgtgtgagtt 


72420 


gtttatctat 


tcccttcact 


tcataaatgt 


72480 


caaggaactg 


attacccaag 


ggaataaacc 


72540 


tctgacctcc 


agttcagacc 


tttttcttgt 


72600 


tccttgggcc 


tgcacctaga 


cctctgccca 


72660 


aatcatatgc 


ctgcccatgt 


tccagggcct 


72720 


tctttgagag 


gtgtgctttt 


gagagaagca 


72780 


gtgcaaacat 


aattgtggtt 


ttatggcaaa 


72840 


catgatctat 


cttttccatg 


tggctgatgc 


72900 


gagaaactgc 


atctggtacg 


ttgccacctt 


72960 


gcaggtaaag 


cctcatgaaa 


gccctgctat 


73020 


cattctctac 


cacaccactg 


aggtcctgct 


73080 


tcagccaaga 


attaccctgg 


agattgagag 


73140 


caataaatga 


tcaatcccag 


acctcctgag 


73200 


atggcaaaga 


tgcataacag 


attcatgagt 


73260 


ctttcaggtg 


catagcagcc 


tggttggaag 


73320 


tttcattatc 


ctgtctgtca 


aacaggccac 


73380 


ggacaaaaat 


aatataccaa 


caagaagtta 


73440 


taccacggac 


tgctatacag 


ccaaagccag 


73500 


tctaatctta 


tgaggtattc 


aaaagccttg 


73560 


ctctattggt 


tctggtttgg 


gcatctgggg 


73620 


ctacagggat 


gagattaagg 


ggattacata 


73680 


aggttgatga 


tgaaaaacta 


aaacttactg 


73740 


ccaattttgg 


aggcataaat 


tatgagttta 


73800 


tgaattaaga 


atggaaacag 


agcagaaaga 


73860 


acagtatgtg 


tgcatggtaa 


ggagttaatt 


73920 


agaaaaagag 


gataggaaat 


atgggatgag 


73980 


aataatgaat 


aatactgatc 


aggtgagaat 


74040 


gaaagaagtg 


tgatatatta 


aatcaggaga 


74100 


tgagaatgat 


tgcaggagaa 


ttctatctct 


74160 


aatagaaata 


atatgtctga 


agttgaaaga 


74220 
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ttaaggggca tgtgcatggt cagcaacaca 
agggacggct gggtcagtgg ctcaccccta 

5 

atggatcacg aggtcaggtg ttcaagatga 
actaaaagta caaaaactag ctgggtgtgg 
10 gaggctgagg cggagaatta cctgagcctg 

tcacgccact gcactccagc ctgggtgaca 
aaggaataaa gaaattgcag aatatttgtt 

15 

agaactcagc tggaagatgg acagatagaa 
ctcattataa agagggcctt gctcccaagg 
20 atgaggctga cagcaattgg cacagattaa 

aaattctgaa gagaaatttc agagaaattt 
acgattttac attggaaata agagagattc 

25 

gtaatggaaa aagttatatc ttctctttaa 
atgtgatcaa attatttccg tttattatca 
30 ggacagaaat gatagaattc attcttgtga 

tttactacct tcactgtttt tttctaagcc 
aaagaaaaaa caacaaagaa acaacaacaa 

35 

tacatagaca gggatgtcaa tactaaattt 
tgatccttac aaatttctga gaaagaaatg 
40 aggagctaga gaaagagaga gacttgtcca 

agactggaac ggagctgctg attctcagtc 
accgagtgaa caatgtcatc agaactatgg 

45 

gaagtagatt agttgattta attactattc 
taaggggttg aattaagaca attgtgaggt 
50 tgtcctttct tcattctcta atggttgagg 

taggcatgaa ggcttcagct gacctaagca 
gaactctctc tctgatgctg atttgcactc 

55 

ctgtgctgcg gtggaaacag gcttactctg 
tcacttaacc ttgcctagtg tatcttatcc 
60 ggtagaagga acaatcagtc agtcatgaga 
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ttgagagaaa 


aacaggcaac 


ataagagtaa 


/42 8 0 


taatacaagc 


attttgggag 


gccctggtgc 


"7 A ~i A r\ 


gcctggccat 


gacggtgaaa 


ccccatctct 


74400 


tggcgggcag 


ccgtaatccc 


agctacttgg 


74460 


ggaggtggcg 


gaggttgcag 


tgagctgaga 


7452 0 


gagcaagact 


ccgtctcaaa 


aaaaaaaaaa 


74 58 0 


gtcctgcaat 


cataaaaaca 


agataagcca 


74 64 0 


aggtctggtc 


aaagtcatgt 


ttgtatagca 


7470 0 


taatggtata 


atgagaaaaa 


gtacaagtaa 


74760 


tacctggcac 


agccaattcc 


accttgggaa 


74 82 0 


ggtagtgata 


aagtataaat 


tcatgatggg 


74 88 0 


tgtcaagttt 


gcttgctgga 


acaatttcca 


'i a n a r\ 

74 94 0 


ctgtgtataa 


ataccacaaa 


ccaaagtttc 


n r n n n 
75 00 0 


gatggccagt 


tctgcacaaa 


gaatgtctca 


75060 


gttggtgttg 


gtgattagca 


actctgacag 


7512 0 


ttacagttac 


acccccaaaa 


aaaagaaaaa 


75180 


aaatcaagaa 


gaaatacaaa 


ggggttcaga 


7524 0 


tacaaaggac 


ttgcacagtc 


aaaacctcct 


nrT Art 

753 0 0 


ttagtatgca 


tgtttgatac 


atgaggaaac 


75360 


ggcttacaca 


gctactaagc 


agtaaaggtg 


7542 0 


ctgggagtta 


ttacgccact 


ctgctttctt 


75480 


atgtctcaaa 


gaagtaagaa 


ataagtgtat 


7554 0 


atctcccttt 


ctcctctcaa 


ggaaatgagt 


7560 0 


gtgggaggag 


cagttttatc 


ttcctgatgg 


75660 


tggtcttcat 


ctctatggtc 


acctctcaac 


7572 0 


acatcaatct 


tttatttctt 


gactttcagg 


7578 0 


tgctggaatt 


ctgcctagct 


gtgctcactg 


7584 0 


acttccctgg 


ggtgagtgtg 


ctggccggct 


75900 


ctgcactgtg 


ttgagtatgt 


caccaagagt 


75960 


tacacatggg 


agggcatttg 


cattgtgatg 


76020 
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gaagacagag aagaaaagca gatggcaatt 
tatgaaaata gttaatcatg agaaatcaac 
5 ggaatgagac tctgggaagt taaatgactg 

tgctgaggac actagaacct ggcttgcctc 
ccactagtct ctttaatagt attgacttgg 

10 

agtgaatgtc ttatgctctg catttgcccc 
aagctatatt aaaaatgtat tgttgaatca 
15 tgaattaaaa agccttgatt tatccaaaat 
gtgataaatt gcaattattt gaagaacctg 
aagaattcta tgtatacagt tggaaaaatg 

20 

aagtctgaag acctaacata ttttgtttct 
acattggtaa ttctggcatg tcctcaaaaa 
25 tgacttctta agaaaaaagg gagaaatatt 
gaaaagttaa ccattataga aaagcaaagc 
aatgaacatt aaaaaaaacc attatttcac 

30 

ctcttgattt gcctgacatt gacttcagca 
agattagcca aatagtctga atttccagaa 
35 tttcccagaa aatgtttccc agaaaactag 

agcaaaggac acaacttttt attgtacccc 
tgtggtgctc agactttttt atacaggtgc 

40 

tagtgcataa aggcaagccc catataatga 
caaaataaaa tttttgagtg catattttgt 
45 aaacacagaa taagtcaaca atgacagaca 

catcaaatat tcaaacaaaa ctaatgagca 
tatgaagcat cgaccagttc atacttattt 

50 

tataaatatt aagtttcagc atgtgtgtac 
tacatgagct ccatttcatt gaaagaactg 
55 gcaggggtca tacccactgt atcagatatc 

tccaggctga aactggactg tcagattctt 
cattatgtct tctccctgtc cattgcaccc 

60 

tttacctact gtggacattc aacttttatt 
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gagtagctga 


taagctgaaa 


attcactgga 


76080 


tgagtcaatc 


ttcctatttt 


gtcagcgaag 


76140 


r 

gcctggcatt 


atgctatgag 


tgtgtgcctt 


76200 


ccttataagc 


agaaacaatt 


tctgccacaa 


76260 


taaagggcat 


ttacacacgt 


aactggatcc 


76320 


tggtgatctt 


aaaattcgtt 


tgccttttta 


76380 


aacccctatg 


gacttatggc 


tttatttaac 


76440 


tgtattatag 


agtgtagaat 


gaatactagg 


76500 


gtgatatgct 


ctacttatct 


tggattagct 


76560 


gcatatatac 


atctatcttg 


aacctgattg 


76620 


tctagagtgt 


acttttcctg 


cctcacagtt 


76680 


tgactcatga 


ctgtggatat 


gaagaactat 


76740 


aatcagaaag 


ttgattctta 


tgataatatg 


76800 


ttgagtttcc 


taaatgtaag 


cttttaaagt 


76860 


tgtcatttaa 


gatatgtgtt 


cattggggat 


76920 


aaagcacggg 


gctgtaaatt 


accatttact 


76980 


aacaaggcag 


aatgatcatt 


cccagaaaca 


77040 


acagcatgat 


cattcaatgg 


atcacagtga 


77100 


ttaattgtca 


acaggagtta 


actgatttgt 


77160 


tagtgtttta 


tcctatgtat 


tttaactcat 


77220 


agtctcaggg 


tatatgaaag 


tagctggctt 


77280 


tttaacactc 


gttttgttgt 


tttttccctt 


77340 


taaacccaaa 


ttggaactcc 


attttctccc 


77400 


aacaaaagct 


aaccaggaat 


actcattttt 


77460 


tcttattgca 


agaaagacca 


attgtttgtg 


77520 


acacacacac 


gtagaaaagt 


atttcaattt 


77580 


gtcatgtttc 


ccttgctttt 


acccactgta 


77640 


ctgatactca 


gcttttctgc 


tgttttcttt 


77700 


tgattttagc 


tttatcttca 


aactctcgtt 


77760 


cccactgaca 


cacaccattt 


cttttattta 


77820 


gacgcttatt 


ggataccaga 


cactatgtca 


77880 
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aataagagaa 


acagagatga 


cagtgaaatt 


agggatatat 


acagggcact 


aactattatt 


77940 


5 


tgaacacaga 


ataaagcata 


ccttcttctg 


cctaaaagaa 


tgtaggggat 


aaacaggtgt 


r*i rt a n a 

78000 


gtgtgtgtgt 


gtgtttgtgt 


gtgtgtatgt 


gtatgtgtga 


gagagagtcc 


ttcttttgaa 


78060 




aatccacatc 


ccccatatca 


atcacatggt 


gccaccttct 


gccacaggag 


aggacaatga 


f-t A ^ A A 

78120 


10 


ctcagagtcc 


acaattggtg 


tattcatcta 


ctgtactcac 


actccagctt 


tgttcctcac 


7818 0 




cccctgccac 


catgtcttca 


ccaccaccac 


agcactctgg 


ataccagaag 


tcttactgat 


78240 


15 


tacccagtca 


aacttctggt 


cttcaaagtt 


ccccccgggt 


ctctcttcag 


ctttgacttg 


783 00 


tttatcagag 


aaggcatcca 


gccctttagg 


caacactcaa 


agccactttt 


tatcctaggc 


78360 




tcaacctcac 


tgccatgtca 


tccactgata 


tcttttcttt 


caagtagcaa 


tgccccttca 


78420 


20 


gcccaagtca 


agtgagaatg 


atgcctacct 


tgagatagaa 


gatgtattga 


gataaccttt 


78480 




gtaaagcagt 


taccaatata 


gctgacacat 


gaaagatgct 


cagtgaactc 


gagttttatt 


78540 


25 


gtgtgtttac 


ttccctttgc 


tttcacttcc 


atagtccaac 


tttccttccc 


cttccacttc 


78600 




caccctctat 


gtctaaaatt 


aacttcaacc 


ccaatacatc 


gctgggattt 


ggaacctgat 


78660 




tatactcatg 


aaatctttca 


tttcttcaag 


aaagtctggg 


ccaggcacag 


tggctcttgc 


7872 0 


30 


ctgtaacccc 


agcactttgg 


gcggttgagg 


cgggcagatt 


gcttgagttc 


aggagtttga 


78780 




gacaagcctg 


gagaccatga 


ccaaacccta 


tctctataaa 


aaaaaaaaaa 


aaaaattagc 


7884 0 


35 


caggtgtggt 


gacacatgcc 


tgtagtccca 


gctactcaag 


aggctgaggc 


aggaaaatca 


78900 


cctgaaactg 


agaggttgag 


gctcattgag 


ccatgatctt 


gccactgcac 


tctagcttgg 


78960 




tggacagagc 


aagaccttga 


aaaaacaaaa 


aaaaaggaaa 


gagaggaaaa 


aaaagagaga 


7902 0 


40 


aagaaagttg 


tagaatgctc 


taaataggat 


gactcagact 


gtgtaggaaa 


catgttttct 


79080 




cttccctttg 


caggagaaag 


attttagaat 


tattggcctt 


tcccaatttc 


tgcacagttg 


7914 0 


45 


actctactga 


caccttatgg 


tgataacgag 


gaacaacttt 


tctcccaaga 


gaaatagaaa 


79200 




aaggcaaaac 


aaggtatctg 


gaattcactg 


gaggtatcta 


acttgaccac 


aggaaatcac 


79260 




acttgctacc 


tttgtccttt 


acagtgtctc 


cacatgtcat 


cagtgagggg 


aaactatgca 


79320 


50 


tttttcaaaa 


gttatttatt 


attgtaagga 


aagtggctgt 


gcttcagtca 


ggagtaggcc 


79380 




aaggtaaaca 


tccggtatgg 


tatgacacag 


cgggtttgga 


gcgcaggtgc 


acaaccccat 


7944 0 


55 


gcattatgta 


accatttact 


ataatctgtt 


tgtgtgagct 


catacctggc 


tttgagccac 


79500 


tctgtctgtg 


agtaatataa 


ctgcactgct 


gactctgtag 


gacaggagag 


agaataaagc 


79560 




cacgttccaa 


ctgcctacaa 


tccaagtgtt 


ctttcagcta 


cccaccacgt 


gtccaccagc 


79620 


60 


tcccttcaga 


acccagctca 


ggttggaacc 


tgtcaattgg 


catagtcagt 


aggatcccaa 


79680 
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gatgagtgag cccttggcct caatgatccc 
tggtacctga tggtagctgt gctgctagga 
5 gacaggtctc ctgtgagcat ggagaaggca 

aggacaatac ctttgctggc atagttcaat 
acactcagtc cctgcaggat gcagcaaagg 

10 

agaggcccag acacacagct tggaatgaga 
cccaccatcc cagtcagaga ctcctgctca 
15 gcgagtttgc ccagtggtcc atcagaaaat 

ttgggcttgg ggacccccta ccatgacaga 
tatattgtgg gaattagata aacagtgtca 

20 

gatgcctcat ctttgggatg agggagccaa 
gaagttggct tccattatga ctcatccctc 
25 gttagcacag ggcaaggcga ccacacccta 
atatggaaag atgccaaaga aataccagaa 
ttagtgcatg taatttggga gatgggtgtg 

30 

gccaaatgat gaacacttta cctcccacat 
gagtgccttt ggttccctgg ccactgtcct 
35 tgactacttc catggcggcc ctcagggaag 
gtgccataaa aaaggggaaa gtttccctac 
atccccagta ggtgacctgc acacagatgt 

40 

gaaagaaaat tgataagcaa ctccatgaag 
cagagcagca attctggaaa atgcctaaga 
45 gagtgtccca cccagacact ccagctcaag 

cctttttttt ttttttcgag agagacggag 
tggcgtgatc ttggctcact gcaagctctg 

50 

agcctcccga gtagctggga ctacaggcat 
ttttagtaga gacggggttt caccgtgtta 
55 gatccgccca tctcagcctc ccaaagtgct 

gctgttcttg attatttaaa aaatttaata 
atttaaaata cattttctta ttgctttcag 

60 

tatttctggg atgtgaaatt ggttcaacat 
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aggtcagcca 


tgtggcccca 


gcatgtgttg 


79740 


tgggccttgg 


tggaaacatg 


ggtggtggtg 


79800 


ctgaagtacc 


tgtaagcaca 


gagcaatgag 


79860 


gggcgttttt 


gactgtacta 


cggtaagtgc 


79920 


taggggacct 


ccaggcacaa 


acagaaaccc 


79980 


attagagtct 


gccattagtg 


tgggcttggg 


80040 


gtctgatact 


gaggaagaag 


aacatcagtt 


80100 


agagcaggaa 


cagtcattgg 


ggccccaggg 


80160 


gcacatgtct 


tacagtgcct 


ataccccaat 


80220 


gcagtgcttg 


ggggaacccc 


tacctgcctc 


80280 


cagcatcgtc 


tgctctgcct 


ctgaaatgga 


80340 


cctccgtcag 


tgattgcagg 


tgagcaggtg 


80400 


attgaatggc 


tgatggtggc 


catttgaacg 


80460 


actgtgagta 


aatggcagtc 


gtatctagat 


80520 


tggcaggctg 


ttccatctga 


ctacccaagg 


80580 


gagggacttt 


gtgttgggct 


ctgtgcccct 


80640 


cactctatgt 


ggggtgctgc 


atacataagg 


80700 


cagagggctg 


taggtgagac 


cagagagtct 


80760 


tccagggggc 


tacctcttgg 


gacaaaatgc 


80820 


agattgattt 


gattttggct 


ggggttgcgt 


80880 


tactgttaac 


tttgtggagg 


caattgtccc 


80940 


gggagaagga 


tgatgcatcc 


tgcagggact 


81000 


gactactaga 


agatgggtgg 


aggtataaag 


81060 


tctcactctg 


tcacccaggc 


tggagtgcaa 


81120 


cctcccgggt 


tcacaccatt 


ctccttcctc 


81180 


ctgccactgc 


gcccagctaa 


ttttttgtat 


81240 


gccaggatgg 


tctcaatctc 


ctgacctcgt 


81300 


gggataacag 


gcgtgagcca 


ccacgcccag 


81360 


tctcaatacc 


aattatacta 


tttactgtac 


81420 


aagtttaatt 


ttattatttt 


tggtagggtt 


81480 


atacaaatca 


ataaatatga 


ttcaccacat 


81540 
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aaacagaatt aaaagcaaaa accatatgac 
taaaattcac catccttcat gttaaaaatc 

5 

actcaaaata ataagagcca tctatgacaa 
actggcaaaa gcaggcagca ttccccctga 
10 catcacttct attcagcata gtactggaag 

aaataaaagg catccaaata gaaagagagg 
aacgtgattc tatacttata aaatcctaaa 

15 

aacaacttca gtaaagtttc aagatacaaa 
tataccaata atgttcaagc tgaaagccaa 
20 caaaaagcat aacataccaa gagtagagct 

gaactacaaa acactgctga aagaaatcag 
atgctcatgg atagaaataa tcaatattgt 

25 

ttaaaaaatt gtatatatcg atttttatca 
taacaaagtc tttttttctt atttgatgtg 
30 ctattgtcaa attatatatg aataactgga 
ttaaaaatac atcttcctat tctatttgcc 
tatatgaggg agagtgattt ataatttctc 

35 

tgtgcctgta acaaattacc acaagctttg 
gttctgtggg tcaggataca acactggtct 
40 actactttcc tctctggagc ttctagggaa 

cttgaggctg tctcattcat tgactcatgg 
gtccttctca tgttgcacct ctgactattc 

45 

tgggaaaggt tctccactta taaggactct 
caggataaac tctgtgttaa gatcaactgc 
50 aaaatcattt gccagatatc atagcatact 

ctttgggtac tattattctt ggtaacatac 
tagaagaaaa ataacaactg tgtcctggcc 

55 

catgtggtga ggcaatgagt cataaccaac 
ggtgtgacat gagctgatga aaaagaactt 
60 acaaatttgg aagttttaaa atggtaccct 
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atctcaatag 


atgcagaaaa 


aggtttttga 


O "1 s~ f\ rs 

8160 0 


cctcaacaaa 


ttaggcatca 


aaggaacata 


O 1 DO 0 


acccacagcc 


aacagccagc 


atcatactga 


8172 0 


gaactggaaa 


aaggcatgga 


tgattattct 


81780 


ttctagccag 


agcagtcagg 


caagagaaag 


8184 0 


aaggaagtca 


aattgtctct 


cttcacagac 


81900 


gactccagca 


aaaggctcct 


agaaatcatt 


81960 


atcaatgtag 


aaccattact 


agcatttcta 


O O f\ O A 

82 02 0 


attaagaatg 


caatctcatt 


tacaatgcta 


82 080 


aaccaagtag 


gtgaaagatc 


tttacaataa 


82 14 0 


agacgacatg 


aacaaatgaa 


aaaacagttc 


82200 


taaaatggcc 


atacttctcc 


atgaatttct 


82260 


aaagctttgt 


ctgcatacct 


_ j_ i_.l_.__. _, _ 4_ 

ctgttgagat 


82 32 0 


tttctgcagt 


aagtgatttt 


cgaagatttt 


82380 


atgagttcaa 


tttggccatg 


accaaatgct 


8244 0 


acttttattt 


ttcagtttct 


tacatctata 


82500 


attatcacac 


tgcttatatt 


_ i_ 4_4_i__4_ _ 1_ 

agttttctat 


82 560 


tgacttaaaa 


caatacaatt 


ttatcttaga 


82 62 0 


cactgggcta 


aaaatcaagt 


tat ctacaga 


82 680 


aaaaatccat 


gaccttgcct 


tttctagctc 


82740 


gctttttcaa 


ggctggcagt 


ggcagatcat 


82800 


ttctatcctc 


aat tctctct 


tagacttacc 


82 860 


tattattaca 


ttagaattgc 


cataattctc 


82 92 0 


ttagcaaact 


tgattctatc 


tgtaactttt 


ri «~i r\ r» r\ 

82 980 


ctcagattcc 


agggagtagg 


atgtggataa 


83 04 0 


tgcctttttt 


gatgttgtta 


ttgcagaaaa 


83 100 


gacactggct 


gggagctcat 


tgtcagtaga 


83160 


gtgggtttct 


ttttgtccca 


tgaaagattt 


83220 


aattcaagta 


ataaatttat 


ttctattttc 


83280 


gtttcctggt 


tccttgagaa 


gttgtggggt 


83340 
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tgaaaggtgg gaaatcaaat gcataatcag catcagcatt acattaaaga gtacacgtgc 83400 

aggtttgtta catgggtaca ttacatgatg ctgaggcttg aggtcccaaa aatcccatca 83460 

cccaggcagt aagcatagta cccaacaggg gctcttcaac ctgttcccct ctccctccct 8352 0 

tccccatcga gtgagtggtc ctcaatgtct attgttccca tctttacatt catatgtatt 83580 

caatatttag ctcccacttg ctagtgaaaa caatgcagta tttggttttc tgttcttccg 83640 

ttcatttgct taggataatg gcctccagtt ccatccatgt tgctgccaag gacaagcttt 83700 

cattgttttt tatgatcgca tagtattcca tgttttatat gtaccacaat tcctttatcc 83760 

aatccaccat ggacaggccc ctaggttgat tccatgtctt tgctatcatg agtagcactg 83820 

caatgaacat atgtgtgcat gtgtcttttt gatagaatga attattttcc tttgggaaaa 83880 

cccagtaaca ggattgctga gtcaaatggt agttgtgttt ttaggtcttc taggaatcac 83940 

cacactgctt tccacagtgg ctgaaccagt ttccattccc accaacagtg tgtaagtgtt 84000 

ctcttttctc cacagccttg tcggcatctg ttgttctgtg actttttagt gatagccttt 84060 

atgactggtg taagatggta tcttattgtg gttttgattt gcatttctct gatgattcat 84120 

gatgttgaac attttttcat atttgttggt tgcttgtatg tcttcttttg aaaagtgtct 84180 

gtgcatgtcc ttttcccatt tttaattgga tttttttttt tttttttttt ggggtttggg 84240 

gtaaaaat 84248 

<210> 78 

<211> 38736 

<212> DNA 

<213> Homo Sapiens 



<220> 

< 2 2 1 > genomi c_DNA 

<222> (1) . . (38736) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP , or dTTP) 
<220> 

<221> initial_coding_region 

<222> (7631) . . (7729) 
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<220> 

<2 21> coding_region 
5 <222> (10476) . . (10682) 



<220> 

10 

< 2 2 1 > coding_region 
<222> (12601) . . (12687) 

15 



<400> 78 





nccattcatt 


ccaccatgtg 


aggacacagt 


20 


taccatcttg 


gaagcaagca 


acgtgaccct 




cgtggacttc 


tcagttccaa 


aactgtgaga 


25 


tttatcratat 

w v** *-* ^— 


tctgt tatag 


cagcacaaat 




ggagtactgc 


tgtatcgaac 


acctaaaaat 




tagaggctag 


tctctaacag 


acgattctgg 


30 


gaacctcagt 


cttagagatt 


attttagtgg 




cagtgagtgc 


cattctgatg 


tggtgtcaga 


35 




cagtggtt ca 


atgcttgtaa 




agattactgg 


aggtcaggag 


ttcaagacca 




accaaaaata 


caaaaattag 


cctgtcatgg 


40 


gaggctgagg 


ctgtagaatt 


gcttgaacct 




gcaccactac 


actccagcct 


gggtgacaaa 


45 


aaaaaaaaag 


tggatcaaag 


gccatccttg 




gtgtctatgt 


cccagggttt 


tgtggaaggc 




ggtgaaataa 


atatctaagt 


gaagtgttca 


50 


aagagagaaa 


tgaattaaaa 


gataaaatgt 




ttagaaaatt 


atcagcctgg 


ccatgtgaag 


55 


ggtatggcca 


agctactgtt 


taaataagaa 




attcactaga 


acagtggagg 


ggtgatccca 




ttcccattac 


aggtgccaag 


tgctaagttt 


60 


atacacacac 


acacacacac 


acacacacac 



ttgtcctatc 


tggaagatgc 


agcaacaaga 


60 


aagtagatgc 


tgaacctgtg 


gcaccttgat 


120 


aataattttc 


tattttttat 


aattacccag 


180 


ggactaagaa 


agaaactggt 


accaataagt 


240 


gtg'gaagtgg 


cattggaact 


gggtaatggg 


300 


taaggactaa 


gaagaagaga 


gctgtagaca 


360 


ttgtaattac 


aatatcaatt 


aaaatgtggg 


420 


tgaaaatgag 


aaataattaa 


aaactggagg 


480 


tcccattcac 


tttgggaggc 


cgaggtgggc 


540 


gtttggctga 


tatggtgaaa 


cctcgtctct 


600 


tggtgcacac 


ctataatccc 


agctactcgg 


660 


gggaggcgga 


agttgccatg 


agccaagatc 


720 


gtgagactcc 


atctcaaaaa 


aaaaaaaaaa 


780 


ttatcaagtg 


gcaaagaata 


tggctgaatt 


840 


agaacttaaa 


ggcaatgaac 


taggatgttt 


900 


gggaactgac 


tgcttataat 


aaaatacggg 


960 


ataatcaaaa 


agtaagtgaa 


atttaaagat 


1020 


ataaaaggct 


gttcaagaaa 


gaaaattaaa 


1080 


atatatatgt 


gtaggtagaa 


gccagatact 


1140 


gtgatatttg gaagatcttt 


ggggctccca 


1200 


ttattgacag 


atgaatggat 


aaactgtgag 


1260 


acactcatat 


aattcagact 


taaaaaagga 


1320 
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gatcttgcca tttgccacaa catggaagat 
ccagacacag agacatattg cacaatctca 
5 acaactaaca atatagaatt cagagagata 

agaagagaga gagagagaaa actggtaggg 
aagataagta ggatgagcaa atctagagtt 

10 

aacaaggtat tgtatttggc atttttgcta 
catagaaata caaacgagtg actatttgaa 
15 ttattatact ttaagttctg ggatacatgt 

cacgtgccat ggtggtttgt tgcacccatc 
aatgctatcc ctccccttgc cccccacccc 

20 

ccctgtgccc atatattctc gttgttcaat 
tggttttctg ttcctgtgtt agtttgctga 
25 ctgaaaagga catgaactca ttctttttta 
gccacatttt ctttatccag tctatcattg 
tattgtgaat agtgctgcga taaacatacg 

30 

ataatccttt cagtatatac ccagtaatgg 
taggtccttg aggaatcacc acactgtctt 
35 ccaacagtgt aaaacaattc ctatttctcc 

ctttttaatg atcactattc taattggcat 
atttctctaa ttaccagtga tgatgagctt 

40 

tcttcttatg agaattgtct gttcatatcc 
gatagaatat cgttcagtca cccaggctgg 
45 acctctgcct gctgggttca agtgattctc 

caggtgcata ccagcacacc tgactaattt 
cgtgttagcc aggctggttt tgatctcctg 

50 

aagtgctggg attacaggcg tgagccaccg 
gggttgtttg tttttttctt gtaagtttgt 
55 cctttgtcag atggatagat tgcaaaaatt 

ctgatgatag tttcttttgc tgtgtagaag 
attttggctc tttttttttt tttttttttt 

60 

ggctggagtg cagtggcgcg atctcggctc 
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actggagaaa 


cttatgctaa 


gtaaacttac 


1380 


tttatatgtg 


gaatctaaaa 


taacaacaaa 


1440 


gagagtgaaa 


ctgtggttac 


caggggtaga 


1500 


gagagcagca 


ggtcaaaaga 


tacaaagtga 


1560 


ctaatgtaca 


gtacaaagac 


gatagttaat 


1620 


aatgactaga 


ttttagctgc 


tcttgccaca 


1680 


ataatagata 


tgttaatttt 


taaattttta 


1740 


gcagaacgtg 


caggttgtta 


cacaggtata 


1800 


aacccatcgt 


ctacattagg 


tatttctcct 


1860 


cagacaggcc 


ttggtgtgtg 


atgttcccct 


1920 


tcccacttat 


gagtgagaat 


atgcagtgtt 


1980 


gaatgatggt 


ttccagcttt 


atccatgtcc 


2040 


tggctgcata 


gtattccata 


gtatatatgt 


2100 


atgggcattt 


gggttggttg 


aagtctttgc 


2160 


tgtgtatgtg 


tctatatagc 


agaatgattt 


2220 


gattgctggg 


tcaaatggta 


tttcaagttc 


2280 


ccacaatggt 


tgaattaatt 


tacactccca 


2340 


acatcctctc 


cagcacctgt 


tgtttcctga 


2400 


gagatggtat 


ctattgtggt 


tttgatttgc 


2460 


tttttcctat 


gtttgttagc 


catataaatt 


2520 


ttcacccact 


tttttttttt 


ttttttttga 


2580 


actgctgtgg 


cacaatctca 


gctcactgct 


2640 


ctgcctcagc 


ctcctgagta 


gctgagacta 


2700 


gttgtatttt 


tagtagagat 


ggggtttcac 


2760 


acctcatgat 


ccatccgtct 


tggcctccta 


2820 


cgcccggcct 


ccttcaccca 


ctttttgatg 


2880 


ttaagttcct 


tgtagattct 


ggatattagc 


2940 


ttctcccatt 


ctgtaggtcg 


cctgttcact 


3000 


ctcttttgtt 


taattagatc 


ccatttgtta 


3060 


tttttaggcg 


gagtctcact 


cttttgccca 


3120 


actgcaagct 


ccgcctcccg 


ggttcacacc 


3180 
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attctcctgc ctcagcctcc cgagtagctg ggatacaggc acccaccacc acgcccagct 324 0 

aatttctttt gtatttttag tagagacggg gtttcactgt gttagccagg atggtctcga 3300 

5 

tctgacctcg tgatccgccc acctcggcct cccaaagtgc tgggattaca ggcataagcc 3360 

accgcgcctg gccaattttg gcttttgttg caattgcttt tggttttatt catgaagtct 3420 

10 ttgcctatgc ctatgtcctg aatggtattg cctaggtttt cttctatggt ttttatggtt 3480 

ttaggtctta tgtttgaatc tttaatccat cttgagttaa tttttgaata agctgtaagg 3540 

aagtggccca gttttagttt tctgcatatg gctagccagt tttcccaacg tcatttatta 3600 

15 

aatatggaac actttcccga ttgcttgtct ttgtcaggtt tgtcaaagat cagatggctg 3660 

taaatgtgtg gtgttatttc tgaggactct gttctgttcc attggtctat atatcttttt 372 0 

20 tggtaccagt accatgctgt tttggttact gtagccttgt aatatagttt gaagtcagta 3780 

gtaaccattt tattctctat atgtatctca taatatcatg ttgtatactt taaatatgta 3840 

caatgaaatt tattttacaa aatgtaagtg cttgatatcc tgatacttaa tgtctaaata 3900 

25 

cttaagacat gtcccctaag aataatattc tactatatag ctttaataca attattacat 3960 

cccaaaattt attaatataa taataatcca atatatagtt cttattacaa tttcctccat 4020 

30 tattccctaa ttataaaaat atgaggataa tggaggggct tatgtgttta tatataaata 4 080 

aacacataaa ttatggttca ctcatacatt tggttctctt gtcctttgag tctttcataa 4140 

tctactcata gagcagtttc tcaccatttt ttgtttgttt gattttgcat ttattttagt 4200 

35 

cttttatgac actgactttt gtttttgttt ttttagtctg gagcatttgt ttcatataat 4260 

gccccatatc ttggacattt cttcttgttt tccatgagtt caagttaata attttaataa 4320 

40 aaacacaaca tagatgatgt gacttaccaa cttttcattg catcacataa ggatgaacat 4380 

aataacagtt tatttcaatt ttggtggtgg ttactttaat caattgattc gggtggtgtt 444 0 

atcagttgaa tgtttatgtc ttcccgaaat tcatatgtta aaattctaac ccctaaagtg 4500 

45 

atggtattaa aaagtggggc ctttgcaagg cgataaggta acaaaggtgg agtcctcatg 4560 

aatgggattg gtgctcttat aaaagagacc ccacagagct ctcttgctcc tttcgaccat 4 62 0 

50 gagaggacac agcaagaaga cagacatcct tgaaccagaa agttggcctt catcacactt 4 68 0 

gaggatatgc cagcaatttg acgtcagatg tcctggcctc caaaatcgtg agaaataaat 4740 

gtttgttttt tcagccacct agattatggt atttttgtta taacagtctg aacagacaaa 4800 

55 

gacaggtggt attcacatgc tattttcatt ttaaaggtat atttttaact tcataactaa 4860 

aaaggaatca gtagtcttat gcttttaaga caaagtgaat attatgtttc aaaaaatatt 4 92 0 

60 ttcactgaat ggtgttagta tccattaaat ttgtcatctc atcctctatt ctcagcctaa 4980 
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atgtattagt ctactttatt actcaagtta 
gtataaaagt tttgcatttc ctgggaacac 
5 tcctaattat gtgtcaatgc tcattattca 

ctctttggac ccaaaacaac tatctgagtt 
tcccatctgc atctcagctt tcatttccca 

10 

aatcttctat tccctaagga catctgtcct 
tggcatggtt taagtcaatg tttccccatg 
15 gattcagaga accatcaacc ttgtcaaaac 

aatcataatt tggacagaga tatttcaaga 
taccaaataa tatatgtaag acacaaagag 

20 

taacaaataa gattttatct tgttaaaaca 
agatgactct ttctagctgt tgattcctcc 
25 gcttaattgt cactttgagg aaaatgtttc 
tccttcatta taggaagaaa cttctcactt 
aatagcttat gctgaacatt agcccttcct 

30 

accaggctgt gttttcaact gaggacttcc 
attacaagaa agattaaaag tcttgcaaac 
35 tttaagactt tcttagtact tttcttaaga 

gagttagagg aatatagtaa tggaggaagg 
ttttgttttc cagcaatcca aagaacaata 

40 

agaaagatat atcaaaagag aagagttcct 
tggaggggga aacacgataa ataagagaag 
45 cctggtcctg aaggagaggg ttggagagtg 

tgggaaacag ggctagctga gaaacaaaaa 
aaggccttct tattctctaa ggagggatcc 

50 

tgcccatgga cctgcaggaa caagagatat 
atattgttag agaaacacca aaatttatct 
55 tgtttctgag ctctgccacg ggaacttggg 

agacgcatca aaatgagtag aaaaagtcgt 
ctgatacagt gagactcgat ctggagaaaa 

60 

gcaacaggag tgggccatga gtccagcatt 
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caactgaaag 


acggaaactc 


ggatatcaag 


. 5040 


ataattgtag 


taaactctta 


gataaattga 


5100 


tgaggaagtt 


agagaataag 


aaggccttaa 


5160 


tttgtttctc 


agctagtcct 


gtttcccagg 


5220 


ctctccaact 


ctctccttca 


cgaccagggg 


5280 


tctcttattt 


gatcttatgt 


cagtttgaca 


5340 


tagctgaggt 


gctaagtgca 


aatattaact 


5400 


cataaaactg 


ctctttatta 


gttgacccaa 


5460 


cattttcaga 


tataaaccat 


aaaacaaata 


5520 


aaagcaattg 


atccgaggct 


actatgtata 


5580 


cccgcatgcc 


cgcataaatc 


agtcagaacc 


5640 


tcctcacaaa 


gagcccacac 


gcttcatttt 


5700 


ttcctttctt 


tccttcacac 


ttccctcttt 


5760 


acatatgaga 


aatcacagtc 


acttttgtga 


5820 


tctctggatc 


ccttgaatgc 


tggaatagtt 


5880 


tgttgcccaa 


tactaatgta 


tccaatgcag 


5940 


acatgaattt 


ttttctttct 


atcccaagca 


6000 


taatatatgc 


aaagggagag 


ataagtggag 


6060 


aaaaggagat 


tttgcttact 


agtttctagc 


6120 


tctcactaaa 


aagtttatga 


tacatgcttt 


6180 


agagtacagg 


gtatgagagg 


tatcatggaa 


6240 


gacacatgtc 


cttagggaat 


agaagattcc 


6300 


ggagatgaaa 


gctgagatgc 


agatgggacc 


6360 


ctcggatagt 


tgttttgggt 


ccaaagagtt 


6420 


ctagaactag 


aaatgccagg 


gagacagttt ' 


6480 


tttatggaga 


atgtgcacag 


agagttgtat 


6540 


ccaactcctt 


tgttgtcaaa 


cataaatatc 


6600 


agcacttcag 


gtgagtgcag 


agccaggaaa 


6660 


ggcattttac 


tcgtcctggc 


ctctttcacc 


6720 


ccctgaagtc 


ccagcttctc 


ccttagggaa 


6780 


ctgggtttcc 


agcggggttg 


cccaaggcac 


6840 
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tggtttctgt 


cttacatgac 


teagagtgea 


5 


gaacaaaggc 


aagcaccctg 


tttcagcact 


agggggagaa 


agaggttaca 


ggctccagac 




attacacaca 


caagcccata 


aaaagcttga 


10 


caaiiaaaaciii 


c ttccaatcrt 


atgaagecac 




ttttcaaata 


ttaaaagcac 


tgcaggataa 


15 


aaacatatct 


aagatgcatg 


ctaatagaca 




aaaaattcat 


caaatatatg 


gcaatacagt 




tgctgcatat 


tgcaaatttt 


gtgtgaccct 


20 


teat get get 


gccaccctgt 


gatgtgttta 




ggaaaaatcc 


tcatgatacc 


aaggttgtgt 


25 


ttctctaaca 


cacaggccaa 


gtctggacta 




gecagaaett 


gggagctctg 


tacttttaag 




caacaccatt 


atgacatcac 


aacctatttc 


30 


tgtcatcaac 


ttctcccaag 


cagagaaacc 




gaagaaacgt 


ctacaggcaa 


aagtcaaagt 


35 


ggagag^ggt 


taggggaagt 


gggaagagat 


gagtgtggag 


acccttaagt 


tttgctggag 




actgtcccag 


aacttttcca 


caggagattg 


40 


t atgaggagt 


tgattttctg 


ttgttgtctt 




ttttctttta 


cgaaagtaat 


ggtggtaaat 


45 


tttcacaact 


geatgettaa 


caatccacac 




gcaaattaca 


gaaaaaatat 


acctcctctc 




aaatgatatt 


taccttttat 


ttgtctgttt 


50 


tgtattgeat 


aaacraaaQQa 


craaaaacraaQ 




gagttttggt 


ttctagaatt 


gcctctgcac 


55 


gactggattt 


caggacctgg 


tgggactggt 




tctatgaggc 


tcaggtgaca 


aaaggtctag 




tgaatctctg 


aggtggatgt 


caggaaggaa 


60 


acatgagaac 


atgeagtatt 


tggttttgtt 



gagtttggat 


gccatgttgg 


gggcctctga 


6900 


agggagattg 


tagtactgaa 


ggcggacacc 


6960 


aagaaacaag 


caaacctctt 


taactgggaa 


7020 


agggccccta 


gaatctctag 


etatgetaac 


7080 


tctgtaatga 


ctggaggagg 


ttgggtgtat 


7140 


aaactaagag 


catgtctagt 


aaaatatgat 


7200 


aaaaataaat 


gattagagtg 


catttgtcta 


7260 


tgttgtcagt 


aggactgggc 


aaagggacat 


7320 


ttgcctctgg 


atattgggac 


ctctcaagat 


7380 


ttatctataa 


tggactgtgt 


caaacattat 


7440 


ggctttcact 


gctccataat 


atcagtttct 


7500 


attggcagaa 


actgacttac 


cagttttgea 


7560 


agctaaatct 


attttttctt 


ccgtagttgg 


7620 


caatgagacc 


ateataatge 


tcccatcaaa 


7680 


cgaacccacc 


aaccaggggc 


aggatagect 


7740 


tattggggta 


aatctaattc 


agaacatgtt 


7800 


gatgtatgtt 


ttgggactgg 


tcatccttgt 


7860 


ggactttggg 


gtagaageca 


cttggagaaa 


7920 


tcttagaaac 


gttttttccc 


ttattctgaa 


7980 


tgtatttttg 


caatacagta 


cttctctata 


8040 


aatgacatat 


tgtatataca 


tttttctgta 


8100 


tgattctttg 


caaggaattt 


ttcccttact 


8160 


cctgttttct 


tcttaccttt 


ttgagaatgg 


8220 


ctcctctcat 


ccagagttta 


ttaaagaagc 


8280 


ggatctatct 


tggattggga 


gggaaacaca 


8340 


ttatgtgatc 


tgaggtgagg 


cactctgttt 


8400 


ttccttgcct 


catgtacaat 


catatgggtt 


8460 


gaaacttctt 


ggtaaatgat 


aatagagctg 


8520 


aactacctcc 


taagacagag 


cttcagttat 


8580 


agtttgctaa 


gggtggtggc 


ctctaaatga 


8640 
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tgagaacaca tggacacgta gaagggaaca 
ggtgggagga gggagaggat caggaaaaat 
5 tctggctaca aaaatctccc aaagccagca 

ccagttcttt ctttccctcg ttggtattta 
tccaacacta gatctgtgct gtgcaatgta 

10 

tctatgatga tggaaatcca caacagctgc 
atcgatgatg tgaaatatgg atagtgcaac 
15 aaatataagt agcccatgtg gctactggca 
tgttttcaaa cttttagtag gaaatatatg 
tgaacatgca ggaagctcaa ggatgcagga 

20 

gagacatcga ggctttagtg agttctactc 
attgaagaag agtgccatct caagacaaat 
25 gcatcaagac tactactggc tactaggagg 
gggaaattgg ggtcccagag catcaactgg 
ggattaaata acactctact gtatggctgt 

30 

acaatagctt tcagaatgct gtgagctcag 
aagtcacttc tgaggtcagt ttatttattt 
35 cagagtggga aataaaacca aggaactgag 

tattgtgacc tcaattttct gacatccagt 
tcccacacat tacaccatcc ttgggcccgc 

40 

ctgcgctcaa tgtggccagt catatgcctg 
aaccaccccc tcactcatct ttgagaggtg 
45 agtgagctat taggttggtg caaaggtaat 

acccaaaatt acttttgcac catgctaata 
ttgcacgagg ggtttctgag ttggccgaga 

50 

tgcttgtggt cacactcaca gtaattttcc 
ccagggaaag aatttaggag agtaacattc 
55 ggaaccagtt catatgttca ttcattcagc 

ggctggacac tggagaggcc tacaacaaga 
cagataaggg aataacttgc aatgtaatgg 

60 

ttgggaatga ttcttaccca gcatgtctct 



PCT/US01/48437 



atgcacactg 


gggcctttca 


gagggtggag 


8700 


aactaatgca 


cagaaactca 


atttggaagc 


8760 


aggaagtcag 


agagagtgcc 


cagaaaaatg 


8820 


gagattattg 


tttggaaccc 


tcagttacaa 


8880 


actgtctatg 


atggaaatga 


aatgtaactt 


8940 


actgttacaa 


tatggcggtc 


acatgtggct 


9000 


tgaattttaa 


attttactta 


attttacttc 


9060 


accatactga 


gtagcctagt 


acaagagtaa 


9120 


ggcctcttac 


tccagagaac 


aatcccacag 


9180 


tgtggtctcg 


gggtgggctc 


agactcacca 


9240 


ttgtcacaca 


ttgtaccagc 


cctttttgag 


9300 


ctagcttatt 


ctaagaaata 


tagactctag 


9360 


cccaggagtg 


tacacttcag 


tcattatgtt 


9420 


tgagttccta 


aaagaagaag 


tgcccatatg 


9480 


gggatgtgcc 


ctggagccta 


cctacctgta 


9540 


ttctcttttc 


tgtgtgagtt 


ggagtaaagg 


9600 


cctccacttc 


ataaatgtag 


aaatgagaac 


9660 


tacccaaggg 


aacaagctgg 


gtcaagtcta 


9720 


ccaggccttt 


attctgtacc 


tcccgctgcc 


9780 


acctagacct 


ctgcccacag 


ccaccccatg 


9840 


cccaggttcc 


agggcctgcc 


tgagactcag 


9900 


tgcttttgag 


agaggcaata 


ggcacgcaaa 


9960 


tgcggttttg 


ccattgaaag 


tagtggcaaa 


10020 


caccatccag 


attttctgtg 


tggttgatgc 


10080 


gagaaactgc 


atctggtaca 


ttgccacctt 


10140 


aggcaagcct 


cgtgggccct 


gctatctgtc 


10200 


tctaccacac 


tactgaggtc 


ctgtttgagg 


10260 


caagttttac 


gttgcagatt 


ctgagttttt 


10320 


aatgatccct 


ccgggacttc 


ctgagggaga 


10380 


caaaaaggga 


taagagagat 


tcgtggccct 


10440 


ttcaggtgca 


tagcagcctg 


gctggaagca 


10500 
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ttctgagtgc tctgtctgcc ctggtgggtt 
taaatcctgc ctcattgcag tgtaagttgg 

5 

tttcttatga ttatcattca ccttacacca 
ctgtaagtat tttttagatg ggatgttcta 
10 cttattattc catcctttga aaatctctct 

gccaaggtta tgtaaatcaa aggggggctg 
acagagtgga attgaaatgt cagataaggt 

15 

tagagtttct taattactca actaatctaa 
actgagaatg ttggtacata gagtggtgaa 
20 gggtgatcag aaaatgctga caaggaaatg 

ggtctaaaaa gacatcagac taaagatgaa 
tggtaagggt ttctcctaat gattacaata 

25 

cattattatt gctttttgtt atagctgaaa 
aaatataggg agatggagag tgtaactgag 
30 aagaagaaaa acaatgaagt ttggctaaca 

ggggtatgtg catggccagc aacgcattga 
gttgcagaat atttattgtc ctgcaatcat 

35 

acgatggaca gatagaaagg tctggtcaaa 
gagccttgct tccaaggtaa tggtataatg 
40 gtaattggta caaattccac cttgggaaaa 

tagtgataaa gtataaattc atgatgggac 
ttgagcttgc ttgctggaac aatttctagg 

45 

ctgtgtgtaa ataccacaaa acaaaagttc 
gatgaccagt tctgcacaaa gaatgtctca 
50 gttggtgttg gtgattagca acagtttact 

caccaaaaac agcaaaaata aaaaataaac 
agatacatag acagggatgt caatactgaa 

55 

ccttgatcct cacaaatctc tgagaaataa 
aacaggagct agagaaagaa agagacttgt 
60 atgagactat aatggagctg ctgattctca 
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tcattctcct 


gtctgtcaac 


ccggctgcat 


10560 


acgaaaagga 


tataccaacc 


agacttcttc 


10620 


tggactgcca 


tagagccaaa 


gccagtctgg 


10680 


attttatgag 


gtttctgaaa 


gccttgattt 


10740 


gttaattctg 


gtttgggcat 


ctggaggaag 


10800 


catggatgag 


attaagggga 


ttacatacta 


10860 


tgatgatgaa 


aaactaaaac 


ttacggtttg 


10920 


ttttggaggc 


ataaattatg 


agtttgtcaa 


10980 


ttcagattgg 


caaacagagc 


agaaagaccc 


11040 


tggtatgtgt 


gtgtggtagg 


gggtcagttt 


11100 


gaaacagaga 


ataggaaata 


tggggttaga 


11160 


atgaataata 


ctaatcaggt 


gaaacggtaa 


11220 


gaagtgtgat 


atcttaaatc 


aggagataag 


11280 


aattatcgca 


ggagaattct 


ctctctttgg 


11340 


gaaataatgt 


gtctgaagtt 


gaaaaattaa 


11400 


gaggaaaaca 


ggcaacataa 


aagtaaagaa 


11460 


aaaaatgagg 


tcagccaaga 


actcacctgg 


11520 


ttcatgcttg 


tacagcaccc 


attataaact 


11580 


aggaaaaagt 


gcatgcaaat 


gaggctgaca 


11640 


attctggaga 


gaaatttggg 


agaaatttgg 


11700 


gattttatat 


tggaaataag 


agagattctg 


11760 


taatggacac 


aacatgtatc 


ttctatttaa 


11820 


atgtgatcaa 


attacctccc 


tttattatca 


11880 


ggacagaaat 


ggtagaactc 


atccttgtga 


11940 


a'ccttcactg 


tttttttcca 


agtcttaaaa 


12000 


aacaaaaata 


aaaaacatcc 


aaacaggttc 


12060 


ttttacaaag 


ggcttacaca 


gccaaaacct 


12120 


atgttagtat 


gcatgtttga 


tacatgagga 


12180 


ccaggcttac 


acagctactc 


agcagtaaag 


12240 


gtcctgggag 


ctattacgcc 


actctgctct 


12300 
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cttactgagt gaacaatgtc atcagaacta cgtacgtctc aaagaagtga gaaacaagtg 12360 

tatgaaatgg attagttgat ttagttacta ctcatcttcc cttctcttct caagaaactg 12420 

aatgagtgaa ggagttgaat taagacaatt gtgaggagga gcagttttgt tttcctgatt 12480 

gtttccttac ttcattctct aatggttgag gtggtcttca tctctatggt cacctctaaa 12540 

ctaggcatga aagcttcggc tgacccaagc atcactgata ttttatttct tgactttcag 12600 

ggaactctgt ctctgatgct ggtttctact gtgttggagt tctgcctagc tgtgctcact 12660 

gctgtgctgc agtggaaaca gactgtctga cttccctggg tgagtgtgct ggccagcctc 12720 

15 gcttaatctt gcctagtgta tcttatctct gcactgtgtt gagtatgtca ccaagagtgg 12780 

tagaaggaac aatcagtcag tcatgagata cacatgggag gtcatttaaa tggtgatgga 12840 

agaccgagaa gaaaagcaga tggtaattga gtagctgaca agctgagagt tcactggata 12 900 

20 

tgaaaatagt taaaatgaga aatcaactgg ttcaatcttc ccattttgtc attgaaggga 12960 

atgagactct gggaagttaa atgactgggc tggcattatg atatgagtct gtacctgtgc 13020 

25 tgaggacgct agaacctggc ttgcctccct tctaagcaga aacaatttct gccacaacca 13080 

ctagtctctt taatagtatt gacttggtaa agggccttta cacatgtgac tggatccagt 13140 

gaatgtctta tgctctgctt ttcctcctgg tgattttaaa attctttttc cttttaaaag 13200 

30 

atatattaaa aatgtattgt tgaatcaaat tccaagggat accaccccta tggacttatt 13260 

gctttattta actgaattta aaagccttga tttatccaga attttattat agagaataga 13320 

35 agaatactag actgataaat tgtaattatt tgaagaaccc agtgatctgc tctacttgtc 13380 

ttggattagc taagaattct atgtatacag ttggaaaaaa tggcatatcc acatgtatct 13440 

tgaacctgag taaagtctga agacctaaca tactttgttt cttctaaagt gtgcttttcc 13500 

40 

tgcctcacag ttactcatgt cctcaaagac gactcatgat gctggatatg aagaactatt 13560 

gacttcttgg gaaaaaacgg agaaatatta attggaaagt tgatttttat gataatatgg 13620 

45 aaaacctaac cattataaaa aagcaaactt gagtttccta aatgtaagca tttaaagtaa 13680 

atgcatattt gttttaaaaa attatttcgc tgtcatttat gatatgtgtt cattggggat 13740 

ctcttggttt gcctgatact gacttcagca aaagcacagg gctgtaaatt accatttact 13800 

50 

agactagcca aatagtctta gacatttccc agaaaactag gcagaatgat gattcaatgg 13860 

atcacagtga ggcaaaggat acagcttttt cttatactcc tcaattgtga acaggagtta 13920 

55 actgatttgc tgtggtgcgc agactctttt atcacaggtg ctactgattt atcccatgta 13980 

ttttactcat tagtgcataa aggcaaaccc cataatgaag tctccgagtg tatgaaagta 14040 

gccggcttca aaataaaatc tttgagtgca tactttattt taatacttat tttgttattt 14100 

tttcctttaa acacagaata aatcaacaat gacagacata aacccaaatt ttaactcaat 14160 



60 
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tttctcctca ccaatattca aacaaaacta acgagtgaac aaatgctaac caggaatgct 1422 0 

tatctgttat gaagagacaa ccagttcata cttacgtcct tatttcaaaa taaaaaagac 14280 

5 

gaattgtttg agtataacaa acataagcct caccttgctc atgcacacac acgcacacac 14340 

acacacacac gtgtaaaaaa gtatttcaat tttatatgac ctccatttct ttgaaagatt 14400 

10 ttgtcatatt tccttgctct gaccctctgt agcaggggtc atacttcata ctcactgtct 14460 

cagatatcct gatactcaga gagtttctct gctctttttt tccagtctca aatggaactg 14520 

tcagtttctt tgattttagc tttatcttca aatctgtttc attaggtctt ctccctgtcc 14580 

15 

tttgcacccc ccactgacac acaccatttc ttttatttat ttgcctattg tggacattca 14640 

acttttattg atacttattg gataccagac actatgcttt taatgccagg gctaccaaag 14700 

20 tgaataagat acatgatttg gcttcaagat atttatagtc aaataaggaa aacagagatg 14760 

acaatgaagt tagggatttc gctaactgtt atttgagcac agaataaaga cactaactgt 14820 

tatttgagca cagaataaag aatatcatct tctgcctaaa agaatgtaga ggataaacaa 14880 

25 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgagaga gagagagaga gagagagggg 14 940 

gggggagaga aagacagaaa agacagaatg tacttccttt gaaaatccac ctccccacta 15000 

30 ttaatcacat ggtgccacct tccgccacag gagaggacca ttgaagcatt gtcatttgtc 15060 

tggggtaata tcagaggttc attgtctcat gccaagaaaa ttgaagacat ggacacacaa 1512 0 

agagtgaatt taagagcaga agtttaatgg ctaaaagaaa gaagagtgct tcctcatgca 15180 

35 

gaggaaggga tcccgaatgg atttctgggt ttggggcaag atgaggctgg ttttatagat 15240 

gagtctgagg aggtcgtgtc tgatttacat caggcatagt agattggttg gaccagatgt 15300 

40 gccatttaca tagcatggga agtgctggcc atcccacact aatcttttat tatgcagatg 15360 

gggtctccac ctggccggca ccatattgcc tgcttttttg ctgcacacat ggtgacaaag 15420 

aaaagggaag agggaacctc catgcttaat atatctagct tccaggtatc ccttttctat 15480 

45 

tggcacaact gctggcattc acctttgcaa gcttccagct tgcttatcta ttcttgcagc 15540 

tggatttttc aggcttcttt tgctagacaa gaaatgattt gggggctgct ttttaataaa 15600 

50 aagaaaatct tactgagaac tcccttaccc tcactagcta cttaaataat ttctttttag 15660 

ctcctgtgtt atttctcccc tcagaagtag taaccctaac tgctgttaga gggtgttgga 15720 

cagtgactct ttctgactac ttcctgctga aaaggggcat ctagtgggga cagcagctag 15780 

55 

ggctcctcct gagttcaatc taacggtcct cagaagaaag gcgtgtccgt gtgtggttct 15840 

gcctgcagca ccatttaaag tttgcttcta ggccaaaaga gataaatttt acaagaaggt 15900 

60 ttagaataca gggtttgaat atgagtatta agattaccat tattagtggg ggtaccatag 15960 
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gccataacca tgccagtgga gtttgatacc 
gtttgcctcc accaggtgtc gcttacttta 
5 ttttgcataa gtgtccccca attattaatt 
ttacagcaat tagaattttc caaccggaat 
gtgttgcaaa tagtagagcg catgtatcaa 

10 

acgtctaaaa cttttactcg ccaagatgta 
aaatgtaatc ccatgtataa ttaaaatctc 
15 atatctggct gcgaatctcg agaggaaatg 

aaggtgggac tgagtaagat gggtagccct 
gcttaagatc tcctattttt tgtcattgat 

20 

ttgctctctc agctttccag gctttgactg 
taacttttac tgctgagggg gttgaaagaa 
25 ggaagagaga gagaagggag agctttcacc 

gatcctattt cttggagttg ggcttttgct 
agagaggtta catgctcaac caactcagag 

30 

aggaaagacc atccatacag cattttaaaa 
gaggtattta agtagaccac agcggggagt 
35 gatgcctttt ctgtatggca tgaaaatgct 
accaggagat attggatgcc cttcatcttt 
tcccctggat agcttcctat tctttgggtt 

40 

ttatttttaa gacagatttc acaagcatta 
tctcctgaaa acaatctctg ggcacattga 
45 tgatggagga ttttaaggac tttccattgg 
tgactgtagc catcctgaga gctggaaaat 
tgcagggaag tactgaggtt taatttcttt 

50 

agtccataga tgtcttgagg cttccttgct 
tttccttcag ctacttcatc ttttcccttt 
55 tttcatggaa ggaaaactga ggatgataac 

tgttttgttt tttttgtttg tttttttgtt 
tcactctgtg gcccaggctg gttgcagtgg 

60 

cctgggttca cgccattctc ctgcctcagc 



tgttagccat tttaatggct tgtaatacta 16020 

ctagacgcgt taatataaaa ggaacctttt 16080 

ctactataaa gattataatt agacagaata 16140 

tccatattgc aggtgccaca atgtatagtt 16200 
ttcccggaaa ggtggtgtag tagataattt ' 16260 

gaatttctcc ttggggttct atgaagttac 16320 

cctgcaaata tgtgttaaaa agaagtgcta 16380 

ttgaaatgac aaaaagtatc tggtgagtta 16440 

tacttactta cttatctttt atgattttca 16500 

gtccaggacg ttcttctggg ctgtcagagg 16560 

gagtgtgatg tatgcaggaa ttaatacctg 16620 

cagtgtaggg cccttcccag cttggggtca 16680 

aataccaaat ctcctgggtt aaataaaggt 16740 

aactgtgtta attcctgctg gaagtgagcc 16800 

atttcttaat ttaatagaaa atcattggta 16860 

gggctcactt tgtctgccat tgtcactttg 16920 

tatttcatta actaacactt tattacctca 16980 

tctaaccagt tggcgaaagt atctgtccct 17040 

ggcatgtggg tgaagtctgt ctgccagtcc 17100 

tgaggaggaa ggagccacct gttcagggga 17160 

acaacctgtt tggctgtttt tggtaagttc 17220 

tgagttttat cctttcccaa gtgaaaagct 17280 

ctagagactg gcaagtagag tttgccatcc 17340 

atacacttga gaagtggccc attctatttc 17400 

atggaacctt cccagattag aggggcttga 17460 

gctgacttag ctgcctgatc agctaaccta 17520 

tgatgtccct tacagtgtag cactgttatc 17580 

ctgctaattt cctggtggtt tttgtttgtt 17640 

ttttgttttt tgtttttttg agacagagtc 17700 

cacaatctcg gctcactgca agctccacct 17760 

ctcccgagta gctgggacta caggtgtgcg 17820 
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ccaccacacc aggctaattt tttgtatttt tagtagagat ggggtttcac cgtgttagcc 17880 

aggatgttct cgatctcctg atcttgtgat ccacccgcct cggcctccca aagtgctggg 17940 

5 

attacaggct tgagccaccg cgccctgccc ctggtggtat tttataggag atccattagc 18 00 0 

agtaagaaag tgtctttcct tccaaatggc accatgagca tggagaacta agaaagtata 18060 

10 cttggagtgt aaatattacc tacctttatt ttgcttaatt caagtgctct tgtaagagct 18120 

atcagttcag ctaattgact gcttgtggct ggagagagag acacactttg aaagacgtca 18180 

tttagagtgg ctactgcata tcctgactgt ggagcttcac agctactatc tgtgaagagg 1824 0 

15 

gtctagtctg gattttctag gagagtttcc ctgagatact ctctggctgc attggtttgt 18300 

attataactt gttcacagtc atattcaggt tccccagctt cctctggaag gaaagtggct 18360 

20 gtgtttaggt gagaacaagt gtttaactca attgtggaaa tctctaacaa cagagtctgt 18420 

taaggagctg gctggctgtt agccaaaggc tccccctaga ggacagcaat cctgccacat 18480 

tatgtggggt gtaaatcatt tcccagggtt aatttggtgg cttctgggac cagtagggcc 18540 

25 

actgtggcaa tggcttggag gcatgtataa ataataggtc cccccaactg caactaaggg 18600 

gttgagaaaa atattagatt agaatttttc ctgagatgcc cctcacgttc atgctatggg 18660 

30 aagaggggag gcctggatta gacaggagaa gagactggct ctgttgtcca gaaggaggtc 18720 

taccttcctt cctttaattt ccagaatcac ctggggcttc tgtgctgttc atgacagttt 18780 

gagccactgg agctggaggt ttgagcccca ggaaccatca gtcctattgg accatctgtg 18840 

agactggttc tgaaactggt gacttctgtc tctgggagca gtctgatctc cagtgatcca 18900 

tgctacagtc tggacagggt caaggtgact tcttgctgcc tggcactcct tcttaaagtg 18960 

40 ccctgacttg ccacatcaat agcatctagc ggatgcacct gaggaatcct gggctttgca 19020 

agcctgcaaa acagctactg gaggctctgt ccttctcttg tgtttcctct ctttctcttg 19080 

ggcatcctcc tgtccctatt taaaagacca aagtggccat gctcaggagg ttctccaagg 1914 0 

45 

tgctatctgg tcctatcacc tgctgctgta gtttcctctt aatgttggga actgcctgtg 19200 

taataaactt gttctttagg atgagccatc ccttgactga ataaggggat aggaagtgtg 19260 

50 ttttattagt gtatctctca gcctttccat aaaggctgca ggattctcat ctggcttttg 19320 

gtctatcatg gacagtttag aataattaaa aggtttggcc ctagttcttt gtaggccctc 19380 

taatatacac attaaaagtg tttcctttac catttatctg cagaactatt ggagttccaa 19440 

55 

tcagggttgg caaaaagaac tgcttccctt cctatttgga atggtgtttc cattattcct 19500 

tcaccttctc tgtctccttt tctccttttc agtctactat aaaaaacatg ttgttcatct 19560 

60 ccaaatttct ctgctagctg cagagctgcc tgcttttcag ctgcggtgag ggtttggctt 19620 
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agaagcagca taacatccct ccatgcgagg 
tctatatacc tattggggtc atcagaaaat 
5 cctgaaatga gcaggaaatg tgaatcctaa 

ggggtaagag tgaagcaggg aaagtgggta 
gtggcatggt tagtgggggt cccagataag 

10 

ctgtctcaga ttgtccccct ggaatatgct 
ttcctgacat ggctgctaaa agagctaggt 
15 tcttgcaggg caaagaaagc ctgtacatag 

tagaaaagat ctaattgttg gataatatta 
ctgttcatcc ccaagatggt aagaagttgt 

20 

aagtctcagg gtcagaggag tcccaatgtt 
aaagacaatc tgttatgcat ctagaaagag 
25 ttagtctcat tccttttggt atgacacagg 
ctactgtttt ctctccttgc ttcctgtggt 
gttgcaggca taacccccca ccccagccat 

30 

tcatgcatac ccatgcagcc ctagttctcc 
tgtatgatct gcatggctcc ccaaaagatg 
35 tttgggcaag actgctttaa tgggaggggt 
atggcccatg ctaaagtgtt tacccttaga 
atccccttac catttaagta ccattctaac 

40 



50 



60 



caggaaccaa 


atggtggttt 


tcctgctaac 


atttttctcc 


taatggttaa 


aggtagagtt 


tggtctcttg 


tggaggggcg 


cacaaaggga 


aaggactgac 


aatgtgccca 


gtgagaggat 


agaaatacca 


cgtgcttgcc 


agatgaaggg 


cttctgctcc 


tagaaaatca 


caaaaatggc 


ggcatttgct 


aatctttcct 


aacaggacta 


ttgcatacag 


aggataagag 


atatggcagt 


gaaaattgga 


tatcctgttg 


ctgatgccgg 


gaaggcttca 


gataacacca 


gggtgtaacc 


ctcttccagc 


cccatgcaac 


agctaggtac 
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tcaaacacct 


gagttaaagt 


tcggaaagct 


19680 


tggcctagtc 


tccctttatt 


tgccaaaggt 


19740 


tagcacaatt 


ttcatcaggc 


atttcttgta 


19800 


gttttagaga 


tagtgcagct 


ggaggagctg 


19860 


ggagactgga 


agggttgaga 


cactcagtag 


19920 


ttagttttgg 


ggaatcattt 


cctatggaat 


19980 


tgattgtgca 


atgcttgcaa 


gtctgggtta 


20040 


gggatcttgg 


atcatttgcc 


ctcccatctg 


20100 


aaatcaagtc 


tccaccgcag 


tgggccaagt 


20160 


acaatagaat 


atgagatgct 


ttttttttca 


20220 


ctagaatgca 


ctcaagagga 


aattcaggcc 


20280 


aactgagaaa 


aaaaaaaaaa 


aggcctatct 


20340 


gtggaggcga 


agacagtgag 


ggcatccacc 


20400 


cccagcacct 


tgttaaatgt 


gctgcccatg 


20460 


gaaactggag 


gaactaagga 


atgggcttag 


20520 


gcttctgatt 


cccctttgac 


tccctagact 


20580 


gatcttggga 


aaggctatgt 


aatagttaca 


20640 


gtactagatt 


gaattctata 


ttttgctatt 


20700 


cagcgattcg 


ggttaacttc 


tggacataaa 


20760 


taaaggcaga 


agaggtttct 


ttaaaaaacc 


20820 


aggacagtct 


cagaactaaa 


attggaggac 


20880 


tttttccatt 


cacagaagga 


gcataaagcc 


20940 


ggaaattgga 


aaagctagag 


tgtttcagca 


21000 


tcttattcca 


ctgggtgatg 


ctgttgacct 


21060 


tagaaagagt 


acagctcatt 


tcagggagac 


21120 


attcccttga 


gctatattcc 


cagtttatat 


21180 


tttccccaca 


ccataaaaat 


ctgtgcagca 


21240 


cgcagatgga 


aaaggaggaa 


atttgcaata 


21300 


atcaagcaat 


cagaggctgg 


ggtcagtcca 


21360 


ctggcgagaa 


atcctcattt 


gctccaaaac 


21420 


tccatgaaga 


gaaactgatt 


caaaatatgg 


21480 
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ccaacatgcc cagtaacctg ggggattggc 
gacttgagaa cagcaggtgg tttgccctaa 

5 

atttttattt gatttggttt taaaatggtg 
tgaggtccac ttctatactc aggagaagaa 
10 aaagatttct ttcaattcac ccagagtgaa 

caataaagtt aggatagcac aaaaagcact 
tgagaatcaa cttatgataa gcttttaata 

15 

cataagtaat catctctaca tttagcagat 
tatccactag actctccagt aggcagagtc 
20 actaggtctg cttagctccc gtattccata 

ggatccaaag tacaccttct gcctcctagt 
tcccttggct tttcttctta aaggaaagcc 

25 

atgatcttcc taagggtcca tgagggatag 
ctttcagaag ctgactcagt ttattacttc 
30 ctctttctct tctaatgaat gaaaagctag 
cagggcgtgg taggcagcta tagtcctgat 
cgggtcattg gctgagatca gctggacctg 

35 

cagtcctgag cctgtgcact gaggcctctt 
ttccaaatgc caggtacagg agaaacccag 
40 aagacacgta aggacaaagg atgtcaagat 

agtcttcctt acttgacctt ctctccagcc 
aaattgcccc atggacataa ttgtggtgca 

45 

gacttgcgtt tcactaccag tccctaaaag 
gaaagagcct ccctgggtac ttggactcca 
50 cagtgttgca tgtcctttcc cagcttcctc 

tctcttgcag tactcttgga gcttttctcc 
tttttttttg catttcaaga cacatcactt 

55 

tttcaaatcc attgtttttc atgcataaat 
ttgtctgtct ggtcatgcac ttcatccatt 
60 gccttattat agatgttata aagatttatc 
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catggtatct 


ccaggaagcc 


tgtcatctga 


A1 P J ft 

2154 0 


aggcatgtag 


atggccatgg 


gatgcccggg 




gctaagagcc 


tcaaaatgaa 


tggacagaat 


2166 0 


tgggagtgaa 


gaacaattag 


acaaagtcta 


2172 0 


gagaatgggg 


tgggggtgga 


taaaaaagtg 


21780 


ttttgaaatt 


aaagcacttt 


_> _ X_ X— X— X_ X— 

aatattgtct 


2184 0 


X— i_ — - X- . — X- X- _ 

ttctcaatta 


tccttacaaa 


cacc tttcaa 


2 19 0 0 


gaaaaagagg 


ctcagaggga 


tgacttactg 


21960 


gggtattgag 


tggtttgagc 


ccttacagac 


22 02 0 


ccaagagggt 


ttgggcaaca 


tctggatcac 


^\ f>. r\ r\ 

22 08 0 


atgtgctaca 


cctaggccac 


gcacctccaa 


22 14 0 


ctgggtgatg 


gctgataggc 


agaatattcc 


222 0 0 


caacctggag 


— . X- — . _ 1_ X- — * i_ X— _ 

gtcattgtta 


ctctccaagg 


22260 


tcaggctgcc 


aggcaaacaa 


acaaaaaaca 


22 32 0 


aagatgttca 


cttgggccaa 


ggaaaataat 


22380 


tctgtggttt 


tccatattta 


ggttcaacct 


2244 0 


ggtgtcatct 


gatcactttt 


gttacaggac 


22500 


cagttctttc 


tctgaaccag 


taatacaaga 


22560 


tgttggagtt 


ttcttgtccc 


aggtttttaa 


22 62 0 


gtagaacttt 


gatgttcaac 


tttctccttc 


22680 


ctgaaggaaa 


tataccattt 


ccatgccatg 


22740 


ttctgactag 


acacaatcaa 


aacacagcag 


22 800 


accacataac 


agctaccaca 


tattcttact 


22 86 0 


acactcttca 


tgctgatcat 


gtaaaaactg 


2292 0 


tttcccaggg 


act ttcatat 


« X- X_ a. « X. «w X— X- 

cttgactgtt 


22 98 0 


accaacaact 


tgcttttttt 


gttttttgtg 


2304 0 


tttgtttatt 


ttgttatata 


aattatttgc 


2310 0 


gcatggccat 


gtctatttct 


tttagttaaa 


23160 


tatatttttt 


tctgtctttc 


tttctcttgt 


23220 


ccttgttata 


atttggcttt 


gccttttgaa 


23280 
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ttgtttatca gatttttatg 
ttttatatgt gtagaaattg 
5 tttcttattt ttttaatcac 

atagtataag ttatagtata 
gaaataaaat tccaagttcc 

10 

caaaattaac ctgaaaaact 
attatacaat cctttctttg 
15 gacctcaaga ttgatggaac 

acgctgaagc ctgctaccag 
ttaagccaga cactcccttc 

20 

gccagccaga aaatgtttcg 
ttggggcata gcttggtttt 
25 aagaggtaca ttggttctgc 
tggttatagg taggtaagag 
cataggaggc aatcacatct 

30 

ctgcccttgt ccacaaggaa 
tctttatttt aggaatagaa 
35 ttccttttgg cttagtgact 

tacttttgag tttacagtac 
ccacttgttc tattattact 

40 

tttttaaaaa tacactcaac 
catttttttg tcattcagtc 
45 gcattttaat ttctgacagt 
tgggcctggt gcggtggctc 
gatcacctga gatcaggagt 

50 

ataaaaatac aaacattagc 
aggctgaggc aagagaatca 
55 caccactgca ctctagcctg 

aaaataaaat gcttttggct 
aagttccaca agggttttat 

60 

aattgacaac cagaaatatg 



ttaatttaaa 


aatgctgaaa 


tttttggata 


cttccaaact 


ctaaatgaca 


acaaggaaac 


ttggcatttt 


cttatttata 


gtataagtat 


agtaatatga 


ggcaggacct 


aatgtatata 


ccaatcaact 


gatggaccct 


cctcttggcc 


agttcaggcc 


atgatggaaa 


gtgggggtta 


gaattcaggc 


acagcttacc 


agcattaaca 


agtctcttta 


agtccaatag 


gaaacattta 


gagtcttcaa 


taagaacttt 


ggtctccaca 


tattgattcc 


aggtctttag 


ataaactctt 


agctacttat 


gatctgtaac 


ccccatctcc 


atacatttta 


gggagacatg 


agatatcaat 


acagaaaggc 


aggacaactc 


aaagctggga 


acaaatggct 


gcattctttt 


gagtttctga 


gcattaattc 


agtgagcaga 


ggcatgactt 


attccttgtg 


agagaggtat 


gtggcttatc 


tgggaggaag 


gtttgcccta 


agaaagtccc 


tgggggtacc 


aatatatttt 


cctttcacat 


acacacacac 


acacaaacac 


acacacacac 


tactgagtta 


actgtgtttc 


tgtctatggt 


atttatttac 


attcattgta 


gtactacatt 


tatttcaaca 


caagtaacac 


atcacttgta 


acaatcctta 


cttcaacact 


actctttatt 


atgcctgtat 


atctagcact 


ttgggaggcc 


tcaagaccag 


cctggccaac 


atggtgaaac 


cgggcatggt 


ggtgcgtgcc 


tgtaatccca 


cttgaaccta 


ggaggtggac 


gttgcagtga 


ggtggcagaa 


tgagactcca 


tctcaaaaat 


agtaagactc 


attccacatg 


aacataaaaa 


ttggaagcac 


attatattac 


atctgaaaaa 


tcagtacagt 


caagtgtcca 


tttgccttcc 
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attaaacata 23 34 0 

agacatcttt 234 0 0 

agttatagtt 23460 

ttttaaatca 23520 

aagggcattc 2 3 5 8 0 

gacacgccgc 23640 

ttaaaacaga 23700 

caatctattc 23760 

attccttatc 23820 

tcaatcaatt 23 88 0 

agcccccact 23940 

caatatttgt 24000 

ggggtcttcc 24060 

ttagcctttc 24120 

tgagttctgt 24180 

ttagtaggta 24240 

tgcttgactt 24300 

ttctctcaga 24360 

gcacatttaa 2442 0 

ttaaaatata 24480 

tgtctagtct 24540 

ttagtacact 24600 

aaaatgattt 24 66 0 

aaggtgggca 2472 0 

gccatctcta 247 8 0 

gctactcagg 24 840 

gccgagatca 24900 

ataaataaat 24960 

tcattaagtc 25020 

aatttggaat 25080 

taaataaaaa 25140 
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atgtgtatct tttatcatat aaattcttca tgttttcaat taggtgaatt accaaatatt 25200 

tcatattttg ttttaatatg aacactttag atgtttaacc tggttattaa caattaaaaa 25260 

5 

tctataacta ttcattgggc ttgaaattaa tcttactttg gtaacaaata tttgtattat 2532 0 

ttacttctag ttttattgca attgattagt taacatggcc tgcaattttt gttttattta 25380 

10 tttggaatgg atatatggcc aattgttgcc aatgtaccaa ggattcttaa aaagcatgct 25440 

ttctttagtg ttgattcatt caagtatttt ataattcaaa tcatattttt tggcttacat 25500 

tcatcaaaac ttatatttgt taaaaatttt catgattact atgtttttgt tagtttctca 25560 

15 

tactcaggca cattttttat tagtagtata ttatattatt tgatcataaa attgtaaaca 25620 

acatattatt ataaatgaat tcctttgtca tatttaagaa ttttccttag acattttatt 25680 

20 gcctaatatt aatacaccat cctttgtctt attattttgc attttctgac ctttatttgc 25740 

ctttccttac ccttacattc actactataa taggaacctt tggaatagaa atgggatttt 25800 

gtcactgtgc tgcttatatc cttctgtggc tacacattgt ctctatgaca aagcccaatg 25860 

25 

tctccaggaa gcattcaaag ccctgtaaat ctctcatttt gcacatccag gatcttatct 25920 

ctctcccaac tgaatctgtc acttctccct ctgtgataga ataataccaa agtgtttcca 25980 

30 ggtctgctag taagccaaac tgtttcagaa ctctagtctc tgcacattga tccctctacc 26040 

gagaatgcat atggctacaa gcacctgctg ttctttgaaa gcatgatttg aaagttaatt 26100 

ttcaatggag cctcctcttc caagagaagt ttaatatctg acctttgtgc tcccatgatg 26160 

35 

ctccctatgt actgtgtgac ctttgtgctc ccatgatgct ccctatgtac tgtgttcaag 2 6220 

ggcagggaca ttatcttcca caggtattgt tagtctctag cagtgttgca tgcaccagat 26280 

40 acttaataaa tattggctga ataaaataat tctgaatggt agctcatttg gactagaaac 26340 

ccaggataac tctagaaagg tgggacaaaa atggcagtct tgttcacata tacagtggct 2 6400 

gtcacttagg agtccctccc gtgtgaagca gagaattgtt gctgtacaaa tctgttttca 26460 

45 

aacatgcctt catggaaaga gagtaacagc aacatggaaa aatatgtgaa tttggaacaa 26520 

gagcaaaata agatgacaga gcaatagtag acagtgtaat tacttgatat caatgctagg 26580 

50 cagtgtatta gcatatttct tttgcacaat tgggtgcaca atagtagcat taaatgccgc 26640 

agcctgtgat gtattcatca agactcctca actcagtatc aatttcctac agcctagaaa 26700 

aactgtttgt ctaagtattc attaggaaga gccacaattt ttggtgggag gtgagctaaa 26760 

55 

ctttttactt ggttgtgtca gtgtttttca tatattaatc cagataaaga tttctattgc 26820 

ctctctccaa atcatttttc cattattttt ccaaagaatg gcctgctttc ttaatttttc 26880 

60 cctcctcatt ttccatcctt catttccagg gccaagagta aagggttttg atcatttact 26940 
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gatattgttt gtctgtgtcc ccacccaaat 
cacgtttcgt gggaggaacc cagtgggagg 
5 cactgttctc ctgatagtga atgagtctca 

tccctgcaca atctctcttc ttttgtctgc 
ccatgtttgt gaggcctctc cagacatgtg 

10 

tgtaaattgc ctagtctcag gtatgttttt 
acctatctaa attgcaatct gaattggaga 
15 attttttgtg tttctaaatg gaaatggagg 
agaaaatcta aacaaaatat gtaggctaat 
ttcaaaaaat aggaactact ggtttagatg 

20 

tcctgtgtac acctgtgcag tgtattagct 
taggcctatt tcccacctct tagcaaaaat 
25 ttatcatcct gcagtttgga agaatagtgt 

aatcattctg acgaggccat gtaaggttat 
aaagtggggg tgaaaagcac tgattgactt 

30 

ggcggactca gtcctcagag ttacatggat 
gaggaaaatg gaagatggat tctgggcagg 
35 gtctgcccta tgcccaaaaa cactttcttc 

acatgatagg cat t cage tt ttcccaagag 
cactgeattg gaatatgaag ggagctctgt 

40 

atgtgtttct tggecagaca attgagecac 
tggtccattg ttctctacaa ccacatccag 
45 agcccactta atatttggta caaatttgag 

aatccatagg gcaattcttg ggtcttccac 
ttgtcagtct aatagaataa attacaaaaa 

50 

gaaaaatata ccatgaaaca tttaaaattt 
acagaatact atttaaaata aatattaaat 
55 atacatataa aattaaataa atgactaaat 
tttgtgatac taaaaaaaag acaattgaaa 
aaaagggaat gctcacaaat ttattttaaa 

60 

aagaaatggg aaaattttca gatatttttc 
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ctcatcttga 


attgtaactg 


ccacaattcc 


27000 


tgattgaatt 


atgggggcgg 


gtctttcctg 


27060 


tgagatgtga 


tggttttaaa 


aatgggagtt 


27120 


ctccttgtga 


gatgtgcctt 


ccactttctg 


27180 


gaactgtaag 


tccaatacac 


ctttttcttt 


27240 


acagcagtgt 


gaaaatggac 


taatacattt 


27300 


aacagtggtc 


tttgtgacat 


taaataaatt 


27360 


atgaaaaggg 


agaagtacat 


tatcatacag 


27420 


gtatctgaaa 


actacagagt 


tgaagcaaac 


27480 


tttagatgtc 


atggagactc 


cagcccatac 


27540 


ctaccctcta 


ctgtgtgttg 


gettaatget 


27600 


ttccacatca 


agtattggcc 


ttacatccat 


27660 


tgtctgaagt 


ttcaaaacaa 


gactattgat 


27720 


atgtccacat 


tggttcaatc 


attgeaatgg 


27780 


agectagage 


cagagcacag 


tggaggcaag 


27840 


ccccagtcag 


aattcacaga 


ctgtttgaaa 


27900 


caacaagcgc 


atgetacagg 


tcactccttt 


27960 


tgatccatac 


ttccaacaat 


gatcacagct 


28020 


gacatcaagc 


cagaaatctc 


atgcacttaa 


28080 


ctaaacaggg 


aactctgtct 


caatagtaac 


28140 


ctccaggtat 


gttctccagg 


aagattgect 


28200 


catgggaact 


aaaaaattgc 


agcttctagg 


28260 


ggcctagatg 


ccaatcatct 


atagttgttt 


28320 


caaggaataa 


gaaggaaatt 


aaacaaaatt 


28380 


tttaaaaaat 


taaagcatgg 


tgagtagaaa 


28440 


gatatattag 


taatcaaaat 


aaaattgaat 


28500 


atattaataa 


ttagaagaca 


tatttaaaat 


28560 


aaataaaaca 


ctatgccctg 


agaagtgagt 


28620 


tattattcaa 


aatat tcaga 


agattttttt 


28680 


agatcttgat 


ctaaaccaaa 


ccagagcaat 


28740 


aatctctgaa 


aaatcatatt 


taaaacacaa 


28800 
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ggaaatataa tcccacagtg taacatatat cattcccaag gtgtaattat actgagaatg 28860 

tatgggtgtt ttaataatat ttcactgcgg acttcagagg ttagcatgat aaagaaggca 28920 

5 

atcctgtaga acttagtgag ggagcagggc accacgtggc cacatcactg tagaccctgt 28980 

ccctcacagt gacctgaaaa cagccaccac caatacagca gagcgcacgc acctaagctc 29040 

10 ccaggtatct gccatgcaag gccacctaag cccctagcta tctgccatgc agagatgggt 2 9100 

tatggtttac agacagtccc acccctgaag agagctctgt ccttctgcta tactgatggt 29160 

actttggtgg actcagacaa aggcactgcc caacaaggac tagacaccct gttgatccac 29220 

15 

atgcagcaac acagctaggc ctttaacaca gacaagatac aaggacctaa tgatacccaa 29280 

atgaaactaa taaaactcct gggcataatt cagaaaggag caaagcctct cattgcaaag 29340 

20 atgattgtca agattttgac cttttttttt ccctccccac tccctcaaaa aaaaaaaacc 29400 

cccaaaagag cagcaatttg tagacctatt tggctactgg cagcaatccc aaatacccca 29460 

cgtagatatc ctgaaagtat tcatctactg gcttacaata aaggccacta ctatggagtt 29520 

25 

ggaccccgac caacaggaca ctcaggagga cctccaaagg gccacccacc ccacccttct 29580 

cctggacccc attgactcca tttgctgttt gaatcacagg tctcagcaac ctctcagttc 29640 

30 actgagtgga gcctatggca aaagaacaca caggccatcg ggctttggat tcataaacgt 29700 

aactccccac cccttaagtc tgggctgtga atagtgactt cctttcgaag aacacagaat 29760 

gtaaagggga ataaaaagag tagatttaca gtgaagaagc ctgacataca ctacctcaat 29820 

35 

caagtgacta agataacagt ggtaagtcat gttgatagtg tggactgttg atgggatgtg 29880 

atgatactgg ctctttacat ctgtgatctc cttccacaaa acccaaacac cagtctaatc 29940 

40 tgagaaaaat atcagataaa ccccaactga gggagtttct acaaaacacc tgaccagtac 30000 

tcctcaaaac tcttaacgtt ataaaacaca aggaaagtat gaaaaactgc cacagtctag 30060 

aggggcctaa ggagatggac acctgactgg aatgctgcac cttggatggg atcctggaac 30120 

45 

agaaaaagga cattagatca aaatctaagg aaatctgaat aggacagtat gaacgtcagt 30180 

tatcaatgtt tgttcattaa tggtgtatca tactaatgta agatgttcat aattggagaa 30240 

50 actgagtgca gggcacagga gaactctctg ccatcttcac aactcttctg taaatctaaa 30300 

acttttctag tataaaagtt tatttcaaat tgtttaattt ctcttttgtt tttcactctg 30360 

tattctgcta aaccatgttg tccaacagag ctcactaaaa tggcacatgg tgaattcaca 30420 

55 

aaataagatg aactctccaa ttttacatta tcaaccatat acacacacaa tttgtataca 30480 

tacacagttt ggtgtaaatt ctcgttcaat accaggttcc caggaagaaa taaaataaaa 30540 

60 acttaaactc ctttactttg ggagttaagc aaacactctc aaaattattc tgcccccatt 30600 
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tttctgtttt gctacatttt gttcagagat 
attttttttt tattggcatt attccaactt 
5 agagaagttg aaaccaaaat caagcccctt 

tagacctcat acttaatcta gacctatgat 
ggagtatcta aagaaaggag ggtagatgga 

10 

gatgacaata aactccagat ttccagaatc 
aattcaggca ccttttcatg ctgctttcat 
15 tatgttccct ctcagagcat tagagcctta 

ctctgtaaaa tatttgaaga gatttattct 
cacagccttc aggaggtcct gagaacatgt 

20 

tatacattct agggaggcat gagacatcaa 
gtccagaaag gtgggacaat ttgaagcagg 
25 aggtaaattt aaacattttc tggttgataa 
aatagaaagg aatgtctggg ttgagataag 
gatgaagcct tcaggtagca ggcttcagag 

30 

cttttttttt tctttttgag atggagtctc 
acaatctggg ctcactgcaa cgtccacatc 
35 tcccaagtag ctgcgactac aggtgcatgc 
aaggtttcac catgttggcc aggctggtct 
cttggcctcc caaagtactg ggattacagg 

40 

aaatgtttct tatcagactt aaagtctgtg 
gcatgtctca accacacttc ccatcatggc 
45 agagccctgg ctgaggagga agtccattca 

gtttacattg tggtactttt gcaaagtttg 
tcactcttga ggcacttctg tcccattggc 

50 

tctctggtct ctgtgttcct ggcatcctca 
acaaaagtcc tttgccacag tagctatggc 
55 ttctaaagca aaccttacag gcacaatttt 

gaacacctct gagagaaagg ctgaaccaaa 
ccttggaggc tcaggacagc tagtcacggt 

60 

ctcctcagcc tttgaactgt caggcctgag 
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gtttccttca 


gattctgcat 


ggggttgggg 


30660 


ttcttagtac 


ccgtttcaca 


tttgcaaaat 


30720 


aaacaattag 


cccagggaaa 


attacgtttc 


30780 


attatttcca 


aacaccaaag 


cagtaactca 


30840 


gaacaagaaa 


ccttgatttt 


cagcctcaag 


30900 


aatgatttgt 


cttaaaatca 


gttgacgctg 


30960 


tggaggatgg 


agaaatccac 


gtgtgcatat 


31020 


gagacattat 


tggtgataaa 


aagtgtcaaa 


31080 


gagcctaata 


tgagtgacca 


tgacccacga 


31140 


gctcaaggct 


gttggggtgc 


agcttgcttt 


31200 


tcttatacat 


ttaaaatata 


tattggtttg 


31260 


aagtgagtga 


tggggcagct 


tccgggctat 


31320 


ttggtcgaat 


ttatttaaag 


acctggaatc 


31380 


aggctatgga 


gaccaaagtt 


ttatcatgca 


31440 


ataattggtt 


gtaaaatatt 


tcttttcttt 


31500 


cctctgtcac 


ccaggctgaa 


atgcagtggc 


31560 


ctgagttaaa 


gcgattctcc 


tgcctcagcc 


31620 


catcacacct 


cgctaatttt 


tagtagagac 


31680 


ccaactcctg 


acctaaagtg 


atccacatgc 


31740 


cgtgagccac 


cactcccagc 


catggctgta 


31800 


ttgatgttaa 


tgctggagag 


gtataaagag 


31860 


ctgaaccagt 


ctttcaggtt 


acattttaaa 


31920 


gatacttggg 


gccttagaat 


tttatttttg 


31980 


gatatttgat 


ggtggtccca 


ggtgtcaaag 


32040 


cttgctcaaa 


acctcctgat 


gtttttttcc 


32100 


tgttgcatca 


agtgaaatct 


gtctctagtg 


32160 


cactatttct 


ttcaatgcac 


tcacttaggc 


32220 


cctcttgtcc 


tcactgtaat 


ttcagccaaa 


32280 


gctgctatga 


gggcctaggg 


agccagtggc 


32340 


gccactgtgc 


tccattctag 


aactgagcat 


32400 


tgtggtgctt 


catacgaaaa 


tgctgtagta 


32460 
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cttcagagaa acagaaatgt ttcttttctt 
tgaccctaga attctgcatc cttttaaagc 

5 

ctgctaacaa ttcatctgct tcttaaatgg 
gatgtggttc attgattaca ctatattctg 
10 gttttgcttt ctggcctgga catttttgtc 

aagatgctta gtgttcccaa agcatccatt 
cagtgtccca tgacccccct aaatgacata 

15 

ggacttgatt gttgacttca aaccacccaa 
tttcttggcc tctttgccag ctgaagacct 
20 cttgggccca ggcttactgc aggagatgcc 

tcttgcactc cagcacagat cccatggcca 
gggggtgtgt gagtaagcaa gtgtggggtt 

25 

gtgggctctg tgcagggctt gcagctggac 
actctggcat tcagacaaga ggaacatggt 
30 aagccccaga gggggtgttg ccacattcta 
cagtgaacag cggcatgtta acagctctgt 
tctggggcta gctcggtcct gctgctactt 

35 

gcgaagggga aagggccaca gtgttacagc 
agttattgtc ccacgttcaa gaagaatgag 
40 tggggaagaa ttttattaag tgacaaaaca 

gtcccccacc caaagtcagg tggtctctct 
gagtcccagg cttttatggg ctcagaatgg 

45 

gcagaaaagg ctaaaacaaa gacaccactc 
ttatggaagt gtaggtacat gtaaaacagg 
50 gagaaaagca tgccaaatgg gaagagaggt 

ttgtagcttg gctttcaggc tttaaactct 
cctgcccctg tctgcctacg attgtctacc 

55 

caagtacttg ctttaattta tttttcaaag 
gaactaaggg actaggaata taaataatac 
60 cttgcaaaat aagatacagc aaatgctcag 
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tggagcattt 


atttctgcag 


gatttatata 


32520 


attcccacat 


tttggaaggc 


ttttgctaag 


32580 


atccatttcc 


atatctgttg 


atgtaaagtt 


32640 


ttggtttgtt 


ttttgtttgt 


ttgtttgttt 


32700 


ttcctaacat 


acctcttaca 


ttgattcttc 


32760 


gttcaaacaa 


ggcttgtcca 


cttcatctcc 


32820 


ctactgtgtt 


accagtgcgg 


aaccaacact 


32880 


tcaaggctac 


aacagatgta 


acacaggatt 


32940 


ccagccagtg 


acacccctgc 


ccaggctttg 


33000 


ctatctactc 


agcctggcag 


gccacacctg 


33060 


ctgcaactgt 


gcaataagcc 


tctggtggga 


33120 


gggccagcta 


tttcaatcac 


cggcacagga 


33180 


caggcatgtc 


acaagcaact 


cccgcagtgg 


33240 


ggtacccagg 


cagggatgcc 


catatccctg 


33300 


atagctcttt 


tagttctgcc 


atctgcagcc 


33360 


cagtcccatt 


gccctaatcc 


tgcccatggc 


33420 


cctgttgcat 


ggggtggctg 


ccctccactg 


33480 


tttttttgta 


ctcacattca 


gtgggtcctg 


33540 


gttatgttga 


caattgaagg 


gtgaggaggg 


33600 


actctcagca 


gagagggggt 


tacaaggatg 


33660 


cacccagagg 


tgggcagttc 


ccatgtggct 


33720 


aggagtgcag 


gctgattagt 


ttgtgagtat 


33780 


aaaagtgggc 


atgacaatgt 


aaaaggccaa 


33840 


ggaagggtgg 


tgaagggtgg 


ggaccaatca 


33900 


tctccatcca 


gcctcagatt 


tatctgagac 


33960 


ctttgggtta 


aaggtggtgt 


tcaccggact 


34020 


tcctggcgct 


gtcagatgca 


tcaatctccc 


34080 


ttcctagtct 


ttgaacatat 


gtttttagga 


34140 


ccctttcttc 


tagtcccata 


tttacagtga 


34200 


gtgaaaaacc 


aaaataccac 


tttattcctg 


34260 



-212- 



WO 02/062946 





ci Let Ly l. Uyaa 


nn^ ai aj aj rrpah 
yyctctctciyLciL 


t*rrff raat t 

l_ V—. U. O V, C4. L- 




a ^ ^ pt /i q 
CdaCCayLCa 




t-nnnt r^ppp 


5 


aaaccctagc 


aagggaaggg 


aagaacagat 




LactactLctctctL 


yctctya.ctciL»ciL- 


cty ct uyyy^>-y 


10 


CCCCgCaaCL 


tty cycay^d 


L L *wL LL L> L UL • 




yaaat-ayy Ly 




y y zj l 




atyaayCL CL- 


4— /^-i f - 3 r3 f~ (T 

L LLL LaaL l,y 


uyyL l a l. l l l 
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ttctgccgca 


gcttcaaaag 


ccaaggtaga 
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55 


tatatatcca 


gcacttagaa 


caaagectga 




cctttccttc 


ctcttccatc 


tttccctgcc 


60 


agagctagac 


taccct tcca 


tagecagaca 




ttccatagcc 


agacaccctc 


ctcacttcct 
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caaggecaaa 


tgaaattcaa 


ttgatgtgtc 


34320 


caggect ccc 


ccataagagg 


aggcacctcc 


34380 




tctagcaggc 


agactgtctc 


34440 


agccacacaa 


gcccaagccc 


tccttccaaa 


34500 


cctoaaoaoa 


accaggt taa 


qqaaaoqaaa 


34560 


t ac t craaaQQ 


aaatataaacr 


ctagaqacacr 


34620 


tacrciciciacjcfa 




ggaggagaca 


34680 


tatttctgta 


gagaggaaaa 


aagtgaatct 


34740 


y aa l a^ uyy 


nnaa aatata 


t agatacctg 


34800 


crt" a t - aaaatt 


t t aaaatt ao 


catataaaca 


34860 


cacctttcta 


t atatctgat 


tctttcagaa 


34920 


aagt ct cct a 


tgaacctcaa 


agttatttta 


34980 


tctgaaatgt 


ataaaagtcc 


ctqaqtqqca 


35040 


cttcttgcct 


tcctcctttt 


ttgtgtgttt 


35100 


ctcttttc ag 


gacaaaatgt 


caaatqtaaq 


35160 


aaacaataqq 


aaggaaaact 


t tetgectaa 


35220 


ttcttctcaa 


acaaatggaa 


agcaattgaa 


35280 


tatttcccaa 


cagttataaa 


caacccagtt 


35340 


accatgagtg 


tttcccaaac 


tttggaaaca 


35400 


ctattactoa 


cattctactg 


cagtegtcag 


35460 


ttaaaaoatt 


tatttttatt 


tctaaggaca 


35520 


taaaaataac 

L-» VJ C*. V-*- W*. C* t*4- 


aattggccat 


ctgcatttgt 


35580 


aaacctacta 


atacaaacaa 


atctaatctt 


35640 


ctcagggtca 


gegtaggetc 


agaaaggtca 


35700 


aacfcactcta 


cccagaagcc 


tatttacttc 


35760 


ct t cct taac 


tttttQQQQt 


cttcctctct 


35820 


aotacrcrat 


aotaacaocia 


tgagaaaata 


35880 


caatataatc 


t catcaagac 


t taagagctt 


35940 


tcgcttccct 


agatgataga 


gctagatgat 


36000 


ccagagctag 


atgatagagc 


tagaccaccc 


36060 


ccctccaaca 


cttgcccaca 


ggcacaccta 


36120 
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gaatcctcaa gaccactcaa gtggcttcct ggtaccactc ttgctagacc aaagaaaatg 36180 

gtaagatgtg gagggaatta tgcactcgat tcttcatatc ttcgatagtg ttaaatggtc 36240 

5 

catatgcttt cttatagctg ggggcacatg catttacaca aacacatgca cacacacaaa 36300 

acacccttac ataagtgtgt gctgggggaa agctatgcaa ctctgaggct ccaaccactt 36360 

10 tttgtgtact tttcacaatt gcccagcaga tggcagcaaa gtgtcctgag gaactacttt 36420 

ttctgtataa ttcagcccta tgtggcagtg taaagctaga ggctaaattg tccctctaga 36480 

tatataacag agcatagaca ggagaggcat tctgttcagt ttcatctgtg gcccgaagac 36540 

15 

aagagatgac acagaattgt gccaaggggc aggacagaag gaggagtcag cccccggagc 36600 

aggtaattat tcaggtagaa tttagatctt tttcgttgtt gttgctcctg agagcagtat 36660 

20 tctgttacca atactgttta gaattgttta aaaatggatg ctaaaaagca tatggcagtt 36720 

tgttattgtt gtaaaatttt ctacagacca tgcttctcct cattgtctgc acctgaagag 36780 

accacaccca ccacccgttc ttgttttgcc atagagatga caatcgctcc taatttacac 36840 

25 

aataaaaccc cattcaccca agtagaaatt aagaaaaaag agccaaacat ggttttctca 36900 

accagattat aaaacattaa agattgagta aaagagtaag aaattgagta agactgaagg 36960 

30 aagaggattt cattagaaac agaatgtcat ttggaaactg tactggagaa tttcgttaga 3702 0 

aggacattgt tgatgttttg tgtggtttcc tccctcctgt cactatagga gaaggaagga 37080 

ggaacctccc cctcatctcc agctcagaga gagaggctgg aatgggaccg catggggatc 3714 0 

35 

atgttgtaga tcctctgcag tttagggaaa atgagaaact tgagtccaac acaccaggag 37200 

aaaccaaaga atgctaagac ctggactgtc tactctatag ccagtgtttt tatctattgc 37260 

40 tatataataa atcacccaaa cgtagaggtt taacacagca cacctgttac ctcacaggtc 37320 

gggactctgg gcatgactta actgagttct ttgcttcaag gtctcaccaa gttccattta 37380 

aggtattggc agggatgagg cctcaaatga ggtttgacta gagaaggatc cacttccaag 37440 

45 

cttacaatgt tattgctaga attcagttcc ttgcagttgt aggataaatg gtctcatttc 37500 

ttgccaggtg ttggctcgag gtctttgtag gatattggct agagaccact ctcaattcct 37560 

50 aagggctgct tatactctct taccatatca agactgcaac atgacaaatt ggtttctcaa 37620 

ggccaacaat ggtgagaatg agacaccagt aagacagagt ttacagtctt ctgtgacatg 37680 

atcatataat cacataccca aaatcacata catcctgtca cctttgcagg attctggtta 37740 

55 

ttcaaagcaa gtcacagatc ctgcctatat ttacagggaa ataactacaa aagggcatga 37800 

agaccaagag gcaggaatca tctagccacc ctagagtggc tctcatggtg gcacaattaa 37860 

60 gcaagaggag tagattgtgt gggggtggtc tgcacttcca cctgtatgct gtgtgaaaga 37920 
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gattctacat ggatggtctt agagcttgct ctgaaaagaa atcaggaggg agcaaggaga 37980 

ccttgaagag aatgaggctg gagattttct gatggacatc cagggacagg acataggata 38040 

atcgtcagct gccagctaag tccatgagga tactgagagg aagcccagcc aaggaatccc 38100 

agtgatggaa ccacagatgc attcaacatc caccagcctg catagcactg acatcgagtt 38160 

gggaaacggc caatcgagag agactttttc tgttcctctc tcctcattcc tgcccctgac 38220 

tctgtgagga tcaggaatcc ctcttagcaa tcctcagggt gaagagttag agaagaaata 38280 

gcactgaccc ccactttaca tgtcagctgc cctgcctaaa gcaggccaaa ctgaggaaaa 38340 

gaagaaattt taatatgaat gacagttgga ttttgatttt tgcataaatc tatattttaa 38400 

ttactaaagt gactgaggaa ctcttctatt agctgacagt agtaacatct tagatttcaa 38460 

acaagcagca ggggagagct agccctactg aggatcttag aaagaacagt tgtggggcca 3852 0 

ggcacggtgg ctcatgcctg taatcccagc actttgggag gccgtggcgg acgaatcgcc 38580 

tgagttcagg agttcgtgat cagcctggcc aacatggtga aaccctgtct ttactaaaaa 38640 

taaaaaaatt agctgggcat ggtagtaggc gcgtgtaatc ccagctactc agaaggctga 3 8700 

ggcaggagaa tcaccaggct ggtctcaaac tcctga 3 873 6 

<210> 79 

<211> 104644 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (104644) 

<223> n is an undetermined nucleotide (dATP , dCTP, dGTP , or dTTP) 
<220> 

<221> MS4A7_initial_coding_region 

<222> (17493) . . (17639) 



<220> 

<221> MS4A12__initial_coding__region 



-215- 



WO 02/062946 

<222> (91488) . . (91760) 
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<220> 

<221> MS4A5_initial_coding__region 

<222> (64028) . . (64180) 

<220> 

<221> MS4A7_coding_region 

<222> (19439) . . (19573) 

<220> 

<221> MS4A5_coding__region 

<222> (65149) . . (65277) 

<220> 

<2 21> MS4A5_coding_region 

<222> (66764) . . (66820) 

<220> 

<221> MS4A5_coding_region 

<222> (68118) . . (68270) 

<220> 

<2 21> MS4A5_coding_region 

<222> (82002) . . (82109) 

<220> 

<221> MS4A12_coding_region 

<222> (95210) . . (95347) 
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<220> 

<2 21> MS4A12_coding_region 

<222> (96148) . . (96204) 

<220> 

<221> MS4A12_coding_region 

<222> (97865) . . (97981) 

<220> 

<221> MS4A12__coding_region 

<222> (100890) . . (101000) 

<220> 

<221> MS4A12_coding__region 

<222> (101182) . . (101283) 

<220> 

< 2 2 1 > MS4 A7_coding_reg ion 

<222> (21068) . . (21124) 

<220> 

<221> MS4A7_coding_region 

<222> (23741) . . (23947) 

<220> 

<2 21> MS4A7_coding_region 

<222> (27037) . . (27138) 

<220> 

<22 1> MS4A7_coding__region 
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<222> (28139) . . (28210) 



5 <400> 79 



15 



25 



nnncttgagt 


caggatgtgg 


cttttcctta 


agagaaaggn 


tgntacggag 


agaagcaaag 


ccaaggtttg 


tgggaaaagt 


ggntacagtt 


gagagaagca 


cagggtctaa 


gcccatgtgc 


tngaaggcat 


atgcttgtca 


tattgaacaa 


tgtcaattga 


aagcttattt 


tatgttcgcg 


tncagtttgc 


ttggcctttt 


aaagtgctgg 


tgaaactgct 


tttttgtgct 


gagtcagttc 


agtttattga 


tctgggtggt 


gccagctaat 


tgctcttagg 


agcagtttag 


gaagggtcag 


aaaccataat 


ttctaatctt 


gtgtctaatg 



gcaagaaaga ggtctgcttt gggaaagggc 
30 taaactaagt ttctcccaaa gttagttcag 

ggaggttaga agcaaaatgg agtcagttaa 
ttttgcaaag gcggtttcat atcggattca 

35 

taaagaggaa ggcattttgt atcaataaaa 
tgagacggag tctcgctgtg tcgcccaagc 
40 gcaagctccg cctcccgggt tcaggccatt 

ctaaaggcac ccgccaccgt gcccggctaa 
gtttcaccca tgttagccag gatggaattt 

45 

attttaacca acgttaataa tctaaatacc 
taacaaaata ttatatatac cccataaata 
50 atctcaaatc acatgtgtat tgggtcagat 

aagatcttac acttgtgtta aagaacaaat 
aaattgaaaa gctgagttta tgtgtttgtg 

55 

ctctttccct gtggggaaat tcattaagta 
ttcaaactga atagaaagag gtttcataat 
60 agtcttcagt ctttaggctg ggaaagtgag 
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aatcacttnt 


atgggtgtgt 


ggcagggaag 


60 


gctttatagc 


tgcttcttcc 


ctttctcctt 


120 


tgataatatt 


tctgtttgtg 


aatgggtgga 


180 


ttttgcttca 


tgccttgtca 


nttaggaaaa 


240 


atattgattg 


tcaattcaag 


atcaattgct 


300 


caggtcttaa 


tagataggtt 


ttccaggtga 


360 


gattataaac 


atgagccacc 


tcacccaccc 


420 


ctaggtgggg 


gccacaagat 


cagatgagcc 


480 


ccatcggggt 


ctgcaaaata 


tctcaagcac 


540 


aatcttgtag 


cctccagctg 


catgactcct 


600 


ttagtcctac 


aaaagtaatc 


tagtccccag 


660 


tgttactgtc 


tttgtttaaa 


ctgtaaacta 


720 


cctactccca 


ggaatgaaca 


aggacagctt 


780 


gttaaatgtc 


tttcactgtc 


tcagtcataa 


840 


gtgatgaaaa 


gattttatct 


gaacaccact 


900 


taatttttca 


aaaaaaaatt 


tttttttttt 


960 


tggagtgtag 


tggcgcggtc 


tcggcttact 


1020 


ctcctgcctc 


agcctccaga 


gtagctggga 


1080 


ttttttgtgt 


gtgtttttag 


tagagacggg 


1140 


ttttaaattt 


taaccaacat 


tatttttaaa 


1200 


ctgacttgat 


cattacacat 


tctatgcatg 


12 60 


tgtacaaaga 


tcatgtatca 


attaaaaaga 


1320 


gttggtaaaa 


tctgaatccc 


attttctgac 


1380 


taaagatcac 


tagagctata 


aatagaatat 


1440 


tatgtgccca 


gtaaggaaag 


agccccgtgg 


1500 


ttcaccagct 


gggggaaatt 


caggggtttt 


1560 


agttatgcaa 


attgaagact 


gaagactagc 


1620 


ttgtttaaaa 


caagtcctgt 


tgtttataat 


1680 
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cacttaccag tttcggtcaa ctgaatcagc 
tgtttcctaa aaatattttc cttttatact 
5 aaaacttgag caatacattt tgttttctaa 

tgtgtgtatg tgtaaaaatg aaaaataaaa 
aaaaagatgg caaagttgtt tcattcccat 

10 

ttagaaattt gttcaagtca taaaggtatc 
taggccaact aagaattcct aagcctagct 
15 caggtcttgt aactcagctc atacacgacc 
acaaattagg ctaacagcag gaagtaaaga 
cagtgggagg gacaatgatc gggatataaa 

20 

cctttgagtc ccctcccttg taagggagct 
attgctcttc tggtccgtgt ttgttctggc 
25 tgcagtttgc cgccgtcgca gaaccaccgc 
gtcccctgcg ctcctgatcc agcgaggcgc 
tgcacggcta agtgcccggg ttcgtcctaa 

30 

agttctcttc cgtcatccac ggcttccaat 
cttccattcc ttggaatccg tgaggccaag 
35 gccatctttg gagcagccca ccaccatctt 

aatctttgag taagttcaca attgggttat 
gataaatcca cacactccaa tagaaattat 

40 

gttaggcatg gtgtccccat ccaaatctca 
agagacctag tgggaggtga ttagattatg 
45 atcatgagtg aattctcata agatctgatg 

tcacacactg tctcctgcca ccatgtgaag 
atgattgtgt tgtaaatttc ctgaggcttc 

50 

acctctttcc tttataaatt acacagtctc 
agtaatataa tagtcaacat aatttgtctt 
55 aaaagaatga ttactttatt gagaagtggg 
agccttcttt gtatataaaa atggcagtgt 
gaattttgct ggaatttgtg ggaatgggct 

60 

cccttctact gctctcctct caccaccgtc 
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tgcagtgaga 


aaaaacattc 


agttttgatt 


1740 


caaaattagt 


tgctaaccaa 


aattaaagaa 


1800 


atgccactaa 


aattttatat 


gatataggag 


1860 


tgtttccaaa 


tgcgttttag 


taatatcaac 


1920 


tctctgctac 


catctggttt 


aagaaaatac 


1980 


attttgagag 


acaggactag 


ctggatttcc 


2040 


ggagaaggtg 


acggtaccca 


cctttaaaca 


2100 


aatcaggtag 


taaagagggc 


tcactaaaat 


2160 


aatagtcaaa 


tcatatattg 


cctgagagca 


2220 


cccaggcatt 


cgagcaggga 


gcggcaaccc 


2280 


ctgttttcat 


tctattaaat 


cttgcaactg 


2340 


tggagcagag 


ctatcgcttg 


cgtccaccac 


2400 


tgacttctac 


ccctccggat 


ctggcagggt 


2460 


ccattgcccc 


taaaggctcg 


ctattgttcc 


2520 


tcaggctgaa 


cactagtcac 


caggttccac 


2580 


agagctataa 


cactcacagc 


atggcccaag 


2640 


aaccccaggt 


cagagaacaa 


aaggcttgct 


2700 


gggagctcta 


agaacaaata 


ttttcagata 


2760 


ctgaaatatg 


caaggaatcc 


tgttcaccaa 


2820 


cttgcaaatt 


caatgtcgat 


agttgacatg 


2880 


tcttgaattg 


taatccccaa 


gtgtttaggg 


2940 


gggacagttt 


cccctatgcc 


cattcttgtg 


3000 


gttttctaag 


cagtagtttt 


tcctgcactc 


3060 


aagtcccttg 


cttccccttt 


gccttccgcc 


3120 


cccagccacg 


tggaactatg 


ggtcaattaa 


3180 


aggtatttct 


ttagagcagt 


gtgaaaacag 


3240 


tttattatct 


gatctcccca 


tttttcttat 


3300 


actctgcaca 


attaagagat 


gatatttccc 


3360 


gacatagctc 


tcgctaatga 


gacctaagaa 


3420 


aatttcttat 


ttaggcatag 


cccttcctct 


3480 


cttcttttcc 


ttcttgttct 


tcatagtctt 


3540 
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5 



15 



tcaaaagaaa 


aggcaaatag 


aaagctgaca 


cccatggaga 


tgaaagctca 


ccttatggag 


acactgatga 


tactgaaatg 


gccattgcaa 


agagatctga 


cctaaccaac 


ttcaccttgc 


ctgggcctag 


gccaaactaa 


ctttgggagg 


tatctccttt 


ccaaaacaag 


accccttcct 


aaccaattag 


ccacacaatt 


agaaattatg 


agttctcacc 


ctccccagtc 


gctcctgggg 


gtgcttcgga 


tattttgcag 


accctgcaat 


taaactggct 


catctgggtc 


ctgtggcccc 


agctttgatt 


ccctgtgatt 


ttatcttcga 


ccccttactc 


accaaattat 


ccttaaaaaa 



25 

ttgagtaata ataaaagtct agagggtctc 
ctctctatgg caattcccct gtcttgataa 
30 aacccactga gcggttacaa tatcgtaggt 

gacttcttga aaaagcctgc atttctaact 
tcaaattgat atactaatct aggtattttt 

35 

taaacttcaa gttattctaa tattaaaagt 
ctcagtcacc tttccccacc atttagaatg 
40 accttaaaaa tctcctggtt ttattgattt 
acagaaagcc atagtgactc acccagattt 
gaccaacatt gtttcttata tgctgtgttg 

45 

ccatttattg tcaatactca gcagaactat 
aatggaatat gtatcagcat tgcaaacatg 
50 tttcatcagc ttggattctc aaggtctcat 

gtcttatggt gatgctggac atcagtgtaa 
attatcgggg aagccaaagg tttacctagg 

55 

ctcagtttgt ttttgtgaag ccttcgtgat 
ccggattcgg tttctagctt gtacaattaa 
60 ttttgtcagc agagtattat ttgtattggg 



atggattatc caacttgtta ccatacagct 3600 

aactgagcac aaaaacctga gaatctgaaa 3660 

aattctggct gagagcatta ttacagtgaa 3720 

ttctaacctc caagccatcc ttgttcattc 3780 

aacttaattt acagtttgaa acaaagacaa 3840 

gcctggggtc tagactgcct ttgtaggagt 3 900 

gtttaggagt catatagctg gaggctgcga 3 960 

ataacatcac tattgtaaaa cctaagatca 4020 

tgatggatca gctggtacca cccacattaa 4080 

cactcagaaa ctgactaagt gcaagagggc 414 0 

cccaagcaat cagcactccc aactcactgg 4200 

cctaatccct gaatctttgg ggagactgat 4260 

ccatagagtc agctgtgcat gaattaaact 4320 

actggctctg tctaggaact gggcagagag 4380 

cttgttaatt agctctagac agtttcccta 4440 

ggtgaagctt tggcattaag gaatgcaatc 4500 

agaaataatg taattgtaca aatgagcaaa 4560 

gaggttaaaa ttgttttcac taataattac 4620 

caccatttgt ttaatgctat agctgggatt 4680 

ctagtattta atgggtgaag aaattaatct 4740 

atgattcaga cttagagttt ataagtccaa 4800 

cctacagggg tccatagtac ctgttctcta 4860 

taatacctca gggaagcaag attagtaaat 4920 

agctatttca tgaaggccag ggtgagagca 4 980 

aaatgcttta atttgtaact gcatcaacat 5040 

cacagacata gacaattacc ttcaaatcca 5100 

aaatattacc tttgccagga ctaggaacgt 5160 

tggcacaata tgagtttaaa gtatggagaa 5220 

cagctgtgta aacaatgact cactcaagaa 5280 

ttttgtttgt ggggattgtt ttgtttgcat 5340 
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atatatttgt gtattttatg tggttttgag 
tctacccagg tctctagtag tcactttagc 
5 attggcttcc tagtttttca ctacacatac 

tgccatgtgt acatacttcc agaactgcca 
gagaatccac ataatgtcac aggaatgact 

10 

aagaagctct tctatgctgt ggatagtgag 
ctataattta ataagatttt tttttctgag 
15 tgcatttttg tctaccttgg gagggaaagt 

gcagctttgt ttgatttcta gatatttcta 
ccagaaactt ctatgggaaa aaagggaaaa 

20 

cactgaggaa gtaaagcgtg atctggagaa 
aggccagttt taagaaaccc ttttatacta 
25 ttcaattttt ttaaaaaaag aaggagaact 
gcaaattatt ggaacaaatt gttttgttaa 
acaaatggga ttatcttatg gacaaatgtg 

30 

aatgatttct gaggatcagg aaagtggaat 
gcaatagtgc agatcagctg ctgttgggag 
35 gatcagtggt agtgcaaagg agggcatagg 

aacctataaa atggccagga tgtcatattg 
agagaggatt agcaaggcac ctgttgtggt 

40 

agtttaaaag acagttatct gacctctctc 
tgcctgtaaa tatgtctatg tttgaggaat 
45 aatttattgg agcttccaag agtctttaca 

gccgggcacg gtggctcatg cctgtaatcc 
acgaggtcaa gagatcgaga ccattctggc 

50 

atacaaaaaa ttagctgagc gtagtggcat 
gcaggagaat cacttgaact cgggaggcag 
55 taccactgca ctccagcctg gcgacagagt 

aaaaaaaaag aacattgcag gggccaaggg 
gctgaaaagt caacttgtaa aagtcagatt 

60 

acatagacat gagggccttc agaatgaaga 
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gggcatgggg 


gttctgctca 


tctctgtgtg 


5400 


ccagatagat 


tctttagcca 


ttaactaaat 


5460 


aataacgtac 


ggtaatcaga 


tgtgtgactt 


5520 


gttcaaaata 


aacagaccct 


ccttgacatg 


5580 


cacatacctt 


ccaggaagat 


atgtgagtaa 


5640 


cagagctata 


aagaattaag 


gctatagaac 


5700 


ttttccctgg 


tggaataaac 


aaagggaaaa 


5760 


aacatgattg 


tgaagtctct 


tagatttaat 


5820 


agagtttgtt 


aacatttgta 


ataataggca 


5880 


cgtgaacaaa 


aaaaggaact 


atggcatttt 


5940 


ataaaaaaca 


gacaaacaca 


tattggtccc 


6000 


agcccttttt 


aacagaaagt 


gttaatagta 


6060 


cataaaattc 


acttagctct 


cctaattacg 


6120 


attcaatcag 


tcttaggtct 


tagagacgct 


6180 


acagaaatgg 


gatgggaact 


tgaagaatat 


6240 


taagtccttc 


tgatttcgag 


aaattactgg 


6300 


atagaaatca 


ttgcaatttt 


aataggctaa 


6360 


atgaagcaaa 


ctttatagat 


ataaaatgca 


6420 


gagataaaga 


cacctttggg 


agactgtact 


6480 


tacacacaat 


cccagtgtct 


tctaaacccc 


6540 


caagtttatt 


acagactaaa 


gaatcacagt 


6600 


tgaatgacta 


tcactaagag 


atgggattcg 


6660 


agaatggcaa 


gaatgttgca 


ggggcccagg 


6720 


agcactttgg 


gaggctgagg 


cggacgtatc 


6780 


caacatggtg 


aaaccccgtc 


tctaccaaaa 


6840 


gcacctgtag 


tcccagttca 


ggaggctgga 


6900 


aggttgcagt 


gagtcgagga 


gccaagatca 


6960 


gagactccgt 


ctcaaaaaaa 


aaaaaaaaaa 


7020 


tatagcttcc 


cttttgccct 


ctgaagtttc 


7080 


aattgagaaa 


aggcatacaa 


attattcaac 


7140 


cccaaaaaat 


ataggggaaa 


gtatccattt 


7200 
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gtatgcttag gttcaacgaa gtaaggacag ccctgtagaa atatgactgg atataatcta 7260 

tgctaatagg cttagttaga accgagcaag acccgtctgt ctaggttttt cttggcctct 7320 

5 

ctgtgctgca ttccttcatt ttggttgtag gacaggacca tctctggaat gggcatctta 7380 

tgacctacag tcaaacaaga tagctcagat gatttcttta tgaccagttt tacacagata 7440 

10 gagtggaagg agagttagag taacactttt aggttttaag gctggctgtg gcaagagggg 7500 

tgtctggttt ctatgaccca ccttggggag ggattcttgt ttgtacagct agcctcaaag 7560 

gaagatagga ctgagagaca ggaaaggaag tcagagaaaa acttttgctt ctgaggtatt 7620 

15 

gttttggggt attgttttct gagcccctaa atacataata ggtatctttg tctttgccct 7680 

ttttattgtt cttttcctcc ttttttctct tatgtgccac tgtcaccatt ctcaggtggg 7740 

20 ctttcgagaa tgaatgtgca aaattcatga ttagggtcaa ccacatctcc atagctatgg 7800 

gccttacaag cattaactgt ctatcatttg catcttcaga ctgatattca gaacagtttc 7860 

aggattctca cactagccta gagaacaatt ttttaaataa ttccatcaac atattgatat 7920 

25 

999&gagggg cagggaagtg ctaagtagag aagggcaggg tccctggcga ggttccaccc 7980 

tcggcctttg cccacaaacc caggtgagga caggcactcc tgttttcatg ccaaatgttg 8040 

30 catttccaag ccagccatgc ccctcctcct gtgcctataa aaaccctgat accctagtgg 8100 

gtatacacac acaagtggct ggatgtcaag aggaacacac tggcagaaga acacaccaac 8160 

agttgctggc agaccatcga tggcagaaca aagcagatac caaaggaaat tcagctgagg 8220 

35 

gtggttggag gagagccagg ctgctgagca gccctactcc agagaagacc accttcccac 8280 

tccatccccg ttctggctcc ctgtccatct gttgataact actatcacca ctcaataaaa 8340 

40 ccttgcactc actctccaag cccacatatg atctgatttt tctgggacac ttgggcaaga 8400 

accctgggat acagaaagcc ctctgtcctt gcgataaggt agagggtcta actgagctca 8460 

ttaaagcaag gtgcctgcgg acggctaagc tgaaagagca cattgtaaca tacacccatt 8520 

45 " " 

tgggcttcgg gagctgtaaa cactcaaccc tagatgctgc catggagtcg gagcccacgc 8580 

tccccatgac ctgcccgtct gcatgcttcc cttaggggtt tgagctgcgg ggcaccaaag 8640 

50 aagtgagcca cacccccatc acgcaccctg caagggggat aagggaaaac tcccctcgtt 8700 

tcaatatgtt aaagcttatt caccccatga gagccatttg gttggtaggc tattaattat 8760 

tgcctcaatt tcagagcctg ttattggtct attcagggat tcaacttctt cctggtttag 8820 

55 

tcttgggagg gtgcatgtgt ccaggaattt atccatttct tctagatttt ctagtttatt 8880 

tgtgtagagg tgtttatagt attttctgat ggtagtttgt atttctgtgg gattggtggt 8940 

60 gatattccct ttatcatttt ttattgcttc tatttgattc ttctctcttt tcttctttat 9000 
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tagtcttgct agcagtctat caattttgtt 
attaattttt tgaagagttt tttgtgtctc 
5 tatttcttgc cttctgctag cttttgaatg 

ttgtgatgtt agggtgtcaa ttttagatct 
tataaatttc cctctacaca ctgctttaaa 

10 

tttgttctcg ttggtttcaa agaacatctt 
gtagtcattc aggagcaggt tgttcagttt 
15 cttaatcctg agtaaaatgc tattcttaac 

tgttactagg ct tat tact a ggactatgta 
tctaggagaa tgaaaacacc atagaggaag 

20 

atccaagcac ctagaacaca gcctggcaca 
gaaaagtgag gaatattttt catgtggaat 
25 ■ ttcagtaagg agattttctg tgaagtacta 
agcagaggca gagaccaaga gaggcagaaa 
agagactccc ccatgcttag tcatcccatc 

30 

tacaaagcta gagatgctca gcctcatccc 
ttcatcatca ctgaatggtg agtcccagca 
35 catttgtttg tatttcactt ggaaggtagg 
caagtccaga tgaccagtaa agcaagttcc 
agtcccagcc aggcacagtg gctcacgcct 

40 

agcagatcac gaggtcagga gtcgagacca 
actaaaaata taaaaaatta aacaggcgtt 
45 ggaggctgag gcaggaggat ggcgtgaccc 
cccgccactg caccccagcc tgagtaacag 
aaaacgatca cagagcacag tccctgaatt 

50 

ttactttcta gggtgtgtga cagggtaagt 
gccaaaagac agaaaccttc tgttatttta 
55 cctgtgtatt aaagaactgc aaaggcataa 

actgcatgaa tcagccacag cccagtcact 
gtttccccac ctgtaaaatg ctgaagttgg 

60 

ggcgtgatac gagattctaa gaaacaggga 
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gatcttttca 


gaaaaccagc 


tcctggattc 


9060 


tatctccttc 


agttctgctc 


tgatcttagt 


9120 


tgtttgctct 


tgcttctcta 


gttcttttaa 


9180 


ttcctgcttt 


ctcttgtggg 


tatttagtgc 


9240 


tgtgtcccag 


agattctggt 


atgttgtgtc 


9300 


tatttctgcc 


ttcatttctt 


tatgtacgca 


9360 


ccatgtagtt 


gagtggtttt 


gagtcagttt 


9420 


atcagttctg 


ttccattggt 


ctattcacct 


9480 


gtatgcttcc 


tcactgttct 


atttgaacat 


9540 


gtggtttttt 


ctgttctgtc 


cattgatgca 


9600 


gaagaagcac 


tcaaactgtg 


gtatttcaat 


9660 


tttttaaagg 


aatgttgtgc 


cctacacctt 


9720 


cgcgaggcct 


gctgtgtgca 


tggcgagatc 


9780 


cctggcagag 


ctgctgagga 


tggtgtagaa 


9840 


atgcccattg 


tcatgtgtag 


cctccagtta 


9900 


catttgagat 


aattgcattc 


ttttgcccta 


9960 


attctgatat 


caagagcacc 


aggaaataac 


10020 


gtctcacacc 


aggtttgggg 


atttactgtt 


10080 


tggaagtaat 


aatatagcag 


ttaagagcac 


10140 


gtaatcccag 


cactttggga 


ggctgaggcg 


10200 


tcctggctaa 


cacagtgaaa 


ccccgtctct 


10260 


gtggcgggcg 


cctgtagtcc 


cggctacttg 


10320 


cgggaggcgg 


agcttgcagt 


gagcagagat 


10380 


agcaagactc 


cgtcaaaaaa 


aaaaaaaaaa 


10440 


aagattacct 


ggattcaaat 


ctagacttca 


10500 


tacttaacct 


ctctgtaatt 


taggttatca 


10560 


gcagagaata 


cttgctagaa 


agagtcatta 


10620 


agaaaacact 


gaaataatac 


agacatagaa 


10680 


actaactccc 


taagcaatcc 


ctggccttta 


10740 


ggggaaaaat 


cgtttcactt 


tcattccggt 


10800 


gaaaattccg 


tgagtggagc 


caataaccaa 


10860 



-223- 



WO 02/062946 



PCT/US01/48437 





gtcaaacatc 


aggggggtct 


atgaacactg 


ctccaaggaa 


tctctctctg 


gataaccaga 


10920 


5 


ccttgtgccc 


tagccccagc 


ccgtcatctc 


cactgctaga 


gaaaaaaaat 


acagttagtc 


10980 


tgtccaaggc 


tgttagagtc 


ctccaacata 


taagacgcat 


ttccatttta 


ctagcttcat 


11040 




ggtctgcttg 


gggtggggat 


ggcggggaga 


tcgtatcctt 


gaagtaggca 


ctaggactgt 


11100 


10 


gagtcacttg 


cctgtctaga 


gcaccaggac 


tgtttgcaaa 


gggctgagga 


gatttccatc 


11160 




aatggaaaag 


agcatgaatt 


tagagtaaac 


atagattaca 


cttagagatc 


agagatcaat 


11220 


15 


ctagagataa 


aatacacttt 


tctctaaagg 


gggtgggaga 


agatgttgga 


caagaaatac 


11280 


cagattttct 


taatggttca 


tttatgtcag 


gctctatggt 


ggggactttt 


atgttatttg 


11340 




acttaatcct 


cacaacaatt 


ctgaaaaatc 


ttttttcaaa 


ttttatgaga 


aagtgaactg 


11400 


20 


gatatttaac 


tgattttctc 


acgaattcct 


tctgaggctc 


agaagtgtga 


aagaagccca 


11460 




aatgaccaag 


gttctttaac 


aatttggtca 


aaacaggtca 


tagacccaga 


ggaaggtctg 


11520 


25 


gatcctggaa 


gttttcttaa 


gcaaagacag 


tgcttgttgt 


ggatatgaag 


atgttgacta 


11580 


agctgggcac 


agtggctcac 


acctgtaatc 


ccagctttgg 


gaggccaagg 


caggtggatc 


11640 




acctgagatc 


aggagttcna 


gaccagcctg 


accaacatgg 


gtgaatcctg 


tctctagtna 


11700 


30 


aaatacaaaa 


aatagacagg 


catggtggtg 


ggcacccata 


gtcccagcta 


ctggggaggc 


11760 




tgaggcagga 


gaatcacttg 


aacctgggag 


gtggaggtta 


cagtgagctg 


agactgcacc 


11820 


35 


attgcactcc 


agcctgagca 


acaagagcaa 


aaaactccat 


ctcaaaaaaa 


aaaaaagatg 


11880 


ttgaccaaat 


ggcagactct 


tgccatgaag 


caagcccaga 


cctcagtgcc 


aagagatgtg 


11940 




aattagattc 


ctgcctcttc 


cacagactag 


acatgtgact 


gggggcagat 


tccttcagca 


12000 


40 


atatcacctc 


atttggagaa 


gaactggact 


aaaataccct 


ccaagtagtt 


tatgagttat 


12060 




ttaaatctcc 


caggcttagg 


tttgctcctc 


tcttagggga 


tagccttgcc 


acccttcagg 


12120 


45 


aaatggagac 


tgtatatgga 


gcatgttgtg 


ccgccagctg 


ctgcatgtcc 


cagagccatc 


12180 


ttaaacccta 


cccttctgaa 


caatggctag 


tctacaagaa 


gccttaaaag 


ggatgtgaaa 


12240 




gtggcaacag 


tgtcccctca 


acaacaaata 


cttccccaaa 


tatgagcttg 


tgtggagcct 


12300 


50 


gatgactctg 


cctctgaggc 


caatccactg 


gcatctttcc 


caagatgtgc 


ctggacttcc 


12360 




ctgagcatga 


ggccaaagat 


ggagaatgga 


aacacccaaa 


agaagtaaca 


aggagacacg 


12420 


55 


gaattttcca 


caggttgctt 


tcagcttacc 


aagagcctgg 


aaagtgagag 


aatagattct 


12480 


ccaaaattca 


aaactaacat 


tactattatc 


aatatttatt 


gtctactttt 


tgtaaactct 


12540 




ttgcatcttt 


tttctttaat 


tctcccaaca 


cccacagagt 


ggaattatta 


ttatccatcc 


12600 


60 


accatttatt 


taatagacaa 


gaaaactcag 


gctaagcaaa 


actagatgac 


ttaaataatt 


12660 
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ctatgtagga gtagggaaaa tgaattttaa 
aaggtctttt tctcatgtgt cacttctatc 
5 gaaagaggaa gcagaggcac tttttatata 

atcctgtgta atcataaata ctgctctaag 
gcagagcacc aagattgttc tggcaaggaa 

10 

cccagaggtg agctgatccc cagccacagc 
caccttggtt tagatcatga cttttgggga 
15 gaattggatt atcatcctgt ctctgattct 

agtccccttt ttgggtctca gtttactcac 
tcttaggttt ccctcattga ggtgattttc 

20 

ctcagtctta cagagtctac tctcaacaaa 
gaaccccatg gtcctactct aattctgtaa 
25 ctccatgtca tgatttatct tcaaagcacg 
ccctccagtc tatctcctca aggcgaatgt 
tgtttggaga tgatagcatc agtctggaga 

30 

ataggattcc ttctaaaagt agcaacgtgt 
taataattcc acttccatac atcactgtga 
35 tgcaaagatg ttcattattg cattgttcat 
aaatagtcat gaacatgtga ttgattaaat 
accatttggc tttttggttg tagtaaaata 

40 

gtgttttttt gttgtttttt gtttgtttgt 
gctcagtcac ccaggctaga gtgcgtgatc 
45 tcaagcgatt ctcccatctc agcctcctga 

gcctggctaa tttttgtatt tttttagtag 
gtcttgaatt cctgacctca ggtgatctgc 

50 

caggcatgag ccgctgcacc cagccttaac 
agcacattca caccattgtg caaccatcac 
55 cccaaactgg aactccgtgc ccattaaaca 

cctggcaacc accattctac tttcagtgaa 
gaatcataca atatttgtcc ttttgtgact 

60 

ggttcatcca tgttgtgcca tgtatcagaa 
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ctgggaccgt 


acaaatcact 


agtctgcaca 


12720 


ttgtgtcatt 


ttgccagacc 


tcagaattta 


12780 


tcaactgtct 


gtaactacag 


caaaattcac 


12840 


aaagggacag 


gaagtctcag 


aggctggaga 


12900 


cagccagtgg 


gaggttccag 


ctgagcgctc 


12960 


acacaggacc 


aggctgcgag 


aacaggtaga 


13020 


agattgtgtg 


cttttgaaaa 


gacagccagg 


13080 


gaccacctgt 


gtgagttgga 


gcgtcagctc 


13140 


ttgtcaaata 


agaaaattgg 


gtaaaaccat 


13200 


ctcaatgttc 


tgcattctcc 


tggtgtgtca 


13260 


tactgtgcaa 


ttagactgaa 


tcttgcatta 


13320 


tacagaaaat 


tgctgggcag 


agctgttgac 


13380 


gattttagag 


ctgtcttgaa 


agctgtgggt 


13440 


ggaaggacaa 


agcatcctct 


cagactgtgg 


13500 


cttttgtgaa 


acataaagaa 


tcttggttat 


13560 


atctaaattc 


aagtgtcata 


gatttaaatg 


13620 


gataattaaa 


tgcaggaggt 


atcacaaaag 


13680 


agtggcaaaa 


agaaaaaata 


gaaaccacct 


13740 


gaatctagag 


acctccattt 


atttaaatat 


13800 


tgcttaaaat 


aattgtagcc 


ttttaaccat 


13860 


ttgtttgttt 


tttaattgag 


aaagagtttt 


13920 


tcggcttact 


gcaaccactg 


tcttccaggt 


13980 


gtagctggga 


ttacaggcac 


tcaccatcat 


14040 


agacggggtt 


tcaccatgtt 


ggccacgctg 


14100 


ccgcctcggc 


ctcccaaagt 


gctaggatta 


14160 


catttttagg 


tgtacagttc 


agtggcatta 


14220 


caccacccaa 


ctccagaact 


ttcttgtctt 


14280 


atagctttcc 


attcccctct 


cccgccagcc 


14340 


tttgactctt 


ctaggtgcct 


cacataagta 


14400 


ggattatttc 


acttgccata 


ttgtcttcaa 


14460 


tctatcattt 


gcttttaaat 


ttaggctttt 


14520 
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tattatttct attcatcggt caaagatgtc 
gttataaaat ggcatggata gaatgactat 

5 

atataaagag gagagagata aagtgacaga 
taggaaacaa gagtgatcgg gtggcagttg 
10 caaactgtat tcccttgtaa tggtggtact 

taagaaataa caaaggaaaa taaatgttca 
tcaattaaac tttcgagtcc tagcttcagg 

15 

ggtatgagat tcgaaaaata aagaaaggaa 
tagtaggaag gagtctagtt ggtatatttg 
20 gccaagtttc tacaacatga aagcaagttg 

aagatcagca atgtccctgg gagaggatat 
caatagggga ggaagaatag agaggatgag 

25 

acctacaggg atatagaaga gctctggtgt 
ctacctgcta gataagcaat cttgcacaag 
30 tatcttcaaa gttcagcacc catctctgag 

tcttaataga atgccacatt catagtacgt 
ttatcatgaa ctactgtttg attctggaac 

35 

tccttctctg gctttaaaga tgtgctggaa 
gcttcaacat tattcccctc ataggacatg 
40 aaaattccaa gaaatgaaaa ctatctcctt 

ttcagataaa gtatctcatg cccggagagc 
gagtagaaat aaaatggagg tccaaacatg 

45 

tcagagagtt acaacaaaag acactcacgt 
aggatagtgt cattatcctt attttacaga 
50 atcctgccta cggccagctc atcaatatca 

actgatatgt tttgggagtt tatcacatgc 
catcttatcc aacctgtacc acaaacctat 

55 

gtttagcagt aactggtgga tatgggatct 
tcagatgtgc aactagatgc ctaggcattg 
60 cctcactagg aagagattct aggtgtcatt 
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tacagaataa 


tgttgagtac 


aaaaagaaca 


14 58 0 


ttccgtagaa 


t tgtgaatgt 


ctgtgaacat 


"1 A C A A 

14 64 U 


tgggaagaga 


gaatgtaaag 


aat atgtgca 


14 70 0 


aaagttaagg 


agctaatttg 


1___.1_.L- _ A_ .1— 1 l 

ttttcttctt 


14760 


ttatgcactc 


tttcaaacag 


aacaaacagt 


14 82 0 


tggtatcaga 


taggctatag 


cctgaaagta 


14 88 0 


c tgagaatca 


agcattttct 


gcagtcctta 


14 94 0 


cccagaactc 


aggaggctgg 


tgacgaagac 


15000 


ggctattgag 


ccacacagag 


caaggtaaaa 


15 06 0 


ggaaaaagaa 


ate ttcaacc 


tatattccct 


1512 0 


ggaactgatg 


agtggcagcc 


aatgcttcaa 


1518 0 


aatcagttgt 


agetagcegg 


gcaagaacaa 


1524 0 


tagatggcct 


gtgcccaaat 


cttgcctcca 


15300 


ctatttcacc 


tctccaagcc 


l__._4_4_4_4_4__.4_ 

tcatttttct 


n c *i _" n 

15360 


actgcagatt 


aaatgagact 


gtcacaaagc 


1542 0 


gcccaatcaa 


tgttagcaat 


__l_4__4___.i_.L_ 

gat tatcatt 


15480 


ctagaatttc 


tetaagectg 


acactttttc 


15540 


ggctcaccct 


gactccactg 


cagggattca 


15 60 0 


taacctcacc 


cacacttcta 


cccagatttc 


15 66 0 


aaacaaagag 


aac tttaggt 


ttaataattc 


15 72 0 


atgagaaact 


agactaaggc 


acagatctag 


1578 0 


gatcgactat 


tatggaacac 


ctgataaaac 


1 C O >l A 

lbo4U 


gtgtcattca 


aat cctcaca 


agagtcct tc 


15 90 0 


taaggaaact 


ggggatgaat 


gagggaaagt 


15 96 0 


gaactgggct 


tccaatgaat 


attagataaa 


16 02 0 


ctccccctat 


gctgagtcca 


1_ 1_ L_ _ 1_ _ J_ _, l_ 

ttatatgtgt 


16 080 


gaagtggtga 


aaataccttg 


cccagggtga 


16140 


gaacacacga 


ctggactcca 


agacatatgg 


16200 


gactcccatc 


ccagtgcact 


tttccatctc 


16260 


tccaattttt 


acatgttctt 


cactccaccc 


16320 
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aagagaggcc agaaccattc ctaaagcacc 

agctgtggca cctttcaagg atgacaagcc 

5 aatgttgagc tgaaaacaaa tctgggcaac 

cacagaggcc cagcaatgct gctgagttag 

caagggaact ggaaatgtta ggaaggaaga 

10 ' 

acatggcata gagaaaaaaa tatcacaagc 
tgtaaacaca ggctgttaat atttgctacc 
1 5 aagtacttta tgtgactatt tcattcaata 

tctaatgtat aaaattgatt taataacatg 
ctgcaatggt ctaatgcttg gaattcagca 

20 

acgaagccaa tataagcaag gaggaagatt 
aaggaaagtg gaacagcatt catcctcaac 
25 tagaagactg atcagtgcag gtgtagcata 

aaggtagcac aagtgaagca gagatacagg 
cactggaggg agagggagat ggacacattc 

30 

ctcacctctc aattgtggga cccctcaaaa 
cttgaccaca atggatgaac tgtgcccagc 
35 catgttttgt ttctgaagga caaccagcct 

ctacaaagta tggtccctgg gaagtttatg 
gttttgatgc ctcttccagc atcatcagca 

40 

tttctcacag ctttacacca aagggcatca 
tgtaccaaaa cgaagattac ctgcagaacg 

45 taagtccacc tcattataag gggaatactg 

cagatctaca gtgggtctgg gaaactcttt 
aggacatgaa atggagcttt aagggaagac 
taaaaccact agatcttttt cactagcatg 
tttatttatt tatatattta tttattttga 

55 agtgcagtgg cacgatctca gctcactgca 

ctgcctcagc ctcccgagta gctgggacta 
ttgtattctt agtagagatg gggtttcacc 

60 

ccttgtgatc tgcccgcctt ggcctcccaa 
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aacacattga ggcaaaatgt caggcagcag 16380 

tttaactaaa ctctggcctt cagtagggag 16440 

atttcaggtg ctgagccttg ttgctgacac 16500 

tggtggcaga caacttgctc ctggaatcta 16560 

aaggaaggaa gaggaagagg aggagaatta 16 620 

gtctgagcca gacaagtctc aatccagtca 16680 

atatattgaa cacattctaa gtgccaggct 16740 

aaagtcactt gagcttatta gccccaggtt 16800 

taagttacag ggttgttata aggattagag 16860 

ggagttaagt aaatgtttac attactaatt 16920 

ccggatggga caggccatct cgctatagga 16 98 0 

atttttacga agacaaaatg aagactggag 17040 

aaagtgtaat cctggaagat gtggtgtgag 17100 

agatagggaa gggaagctgg aagcagaggt 17160 

agggctacaa agcaagttct atgtgatttg 1722 0 

tgtgtacagt actctcccag tgacatgctt 17280 

atgcccactt tccaatgctc cactgatccc 17340 

tggaataatg gcaaatacct tcttaagttc 174 0 0 

tctgtaagtc aaaccttggg aagtaactga 174 6 0 

tcatgctatt acaatcccaa accatggggg 17520 

ctatccctca aagagagaaa cctggacaca 17580 

ggctgccaac agaaaccacc gttcttgggg 1764 0 

agaaaatact ttgtaaatgt atctagacta 177 0 0 

cactcttttc tggctgactc caggctaatg 1776 0 

tcttaaagct ctgcagcagt ttctagaaac 17820 

cttcacaatt aacaggttag tctttcttat 17880 

gacagaatct ccctctgtca cccagcatgg 17940 

acctccgcct cctgggttca agtgattctt 18000 

tgggtgcacg ccaccacgcc cggctaattt 18060 

atactggcca ggctggtctt gaactcctga 18120 

agtgctggga ttacaggcat gagccactgt 18180 



-227- 



WO 02/062946 



PCT/US01/48437 



gcccagccac aaattagtct ttcttatttt 
accatatgga ttttactcaa tatgattaaa 

5 

gagatgagac gatcaagact aatctcatga 
taagacatcc aagtgtccca gatcatcatc 
10 gttttgaata ccctcttata atacccaggt 

ctgccctgtt tctttacaat ctattcttag 
aaagttagat catgtattct cctggctcaa 

15 

aaatgaaaag caaagttggt aaaacatagc 
ctccagtact tcacaagcct catgtcctac 
20 atggaacttg tctgtacctt tacagacaag 
agtgattttc atttgcttca ttccctcgtg 
caataactgt tgatgatgtg aatggttgac 

25 

atctttcaat tgcacatgca ttaaactatg 
cagctaaact tcagttcttc cacttgcgaa 
30 aaataagatg aattacagca gaagagatga 

tcttcattaa cactcctgtg aaaagtttgc 
ttatctactt ttcaagaaaa taaatttaaa 

35 

aaaatgtgtg tatgatcact ttccgatttg 
ggttagctat acatgacttt ctctagaagc 
40 tctttaatat aacaaattat aatgtatttt 

aataatcact tttatctgtt tgaagtgcag 
tgtccagatc ctgtgttgcc tgttgatttc 

45 

ctacccctcc cacttcaatc cagcaatttc 
aggagctctg tgtgtgagta gaatggggac 
50 accaggtgca tagtcgtgga gtgactcaga 
aggcctcgac tttccttcta tcattgcccc 
gcagagagac ttaatctgtc ttcctgcaga 

55 

gggtatacca gcacagaatt tcttacctgg 
attttccata tccctctttg aggctttgat 
60 taatcaactt ttcaggtaga gagtgagaag 



gtcattagta cagtaaaaat tttctattta 18240 

ttaactttga gcttattttt caaatatttt 18300 

cccatttttc cccgttcata tgagtaacta 18360 

cctcatatcc agaactttaa tctccatgtg 18420 

cctattcact gcctatttag atggatctcc 18480 

tacagcaggt agggtactct tttaaaaatg 18540 

aattatccag tgggattcct tttttgctca 18600 

ctaaaaatcc ctacatagtt tgtaccatgc 18660 

tactttgctc ttaactctat tctgccttgc 18720 

acatagtaca catataagct ctccaatggt 18780 

cctggcacac aacatacctg ttgtggtatt 18840 

ttcttttttt atttctccta tgtttgcatc 18900 

agcaaaagca tgggctttag agcaagctaa 18960 

ataattatta ttaaaatcca atgatgaaca 19020 

cctgctctat ctaatttcaa ctgaaaaatc 19080 

tctctatttc cacctcactg gacaactttt 19140 

aactgtcaag tatctttttc aaaaaaagtt 19200 

ctggtttaat aatcctatca gaaaagaaga 19260 

atatcgattg gctccttgta acatttttcc 19320 

atactcttta tactttaccg tgtgcattca 19380 

tctagatttc ctgtcttctc tcttgtagac 19440 

aagtctgggg gccatcttgg tttttgctcc 19500 

caccactttg atgtctgggt acccattttt 19560 

tctaagaggg gacatgcttt ataaatgcta 19620 

ctccaagagg ccaggaggca aaaggcggca 19680 

acgtcacatt tcccctattc aagaagagca 19740 

gcagggggca agagggaatg gtaatacaat 19800 

cattacagca tacctctaga aaagtggatg 19860 

gaggccattg gattccctcc attgtaagaa 19920 

gccattttta aacaataaaa cgagcaccac 19980 
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tggttgatgc cacaggtttg cagagatttg 
gggacatgct tgctgtgtgt ccagaggtaa 
5 ccttccacct aatggctcat gaaaaagggc 

gggcagactc acacttccca caaagggtta 
gataaaaatc aaatattctg ggcaaatgaa 

10 

cagtccctgt gtgaacacaa actgtgcagt 
gcacaatttg tccttgactt ctcctgtaac 
15 gaagctagcg ttagctcata ctggaccctc 

cacacttaag caacttttca gcttctcaga 
taccactgag attaaatact gatttctggg 

20 

tgctcttccc actttcctcg actcccattc 
agtccatgtg tttctgcacc ttgtcccatc 
25 tgttccaatc tctctgtcca ttcaagtcat 

ctgtaattta gtaacactct tctccttccc 
tcctagtgat ttccactaaa acaaatccaa 

30 

tcaatataga cattcttata tcaggtggga 
agatttctgc agtgttctct ccctaaaagt 
35 gacagtgaca tggtgggcac atttccacaa 
tgggcagttt ggcattactg gatccctctc 
ctttgtaagt ataaggacca tcaaatctca 

40 

aattcttcta gtttaaaaat ccctatttta 



50 



60 



taggaagccc 


aggttttcct 


ggctgtccca 


cttgccttct 


agaccagtgc 


tctttcgtct 


gcttggagga 


ggtttacctc 


agttatttaa 


tctttatagc 


ttgtaacctt 


gaccttgaat 


cagatccttt 


tttctataaa 


acctgcctct 


ttcatagagt 


tgggacttca 


cacaaattct 


tggttttccc 


aggactgtct 


cagttttagc 


actccccagg 


caaagcagaa 


taactggttt 


aaagaagtcc 


atgatatatt 


ttcaaagttc 


tcagtcaaat 


atcaggtggg 


gaggcattca 
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gattaacaag 


ggtaagattc 


atgggaggca 


20040 


ccctgagcac 


tagagaaact 


ctctttgctg 


20100 


ccagtagatc 


catttttcag 


agagtggcca 


20160 


tttcttctcc 


acgtagtatt 


tggtacaaag 


20220 


ctatactaag 


gtaccacttc 


tgtgtggata 


20280 


cacataagga 


ggccctggtc 


ctgccccaga 


20340 


cagtggttgt 


gggccccaca 


tctccttaga 


20400 


tttgcctgtg 


attattcaat 


atccaagcca 


20460 


atacccttcc 


tctgccagct 


ggatctttaa 


20520 


gctttatgat 


aaagacttca 


cgtctgagcc 


20580 


tgactcaagc 


aacatacacc 


aataacttcc 


20640 


agagcctctt 


tcctgagtgc 


ttctgttggc 


20700 


tggtccatct 


caacctcctc 


tctttcaacc 


20760 


aggtacacct 


tttctccagg 


cctcctcagt 


20820 


cccctctgtg 


catctgcatg 


tcattcaaag 


20880 


attaggccta 


tctagcaata 


tagatccaca 


20940 


tttccaacat 


cactttttga 


ttgagtgatt 


21000 


gacaactgaa 


ctgatcattt 


ctctctcttc 


21060 


aattatctct 


ggaaaacaat 


caactaagcc 


21120 


gctggttgga 


attgaaggaa 


cgtcagagtt 


21180 


caatttagag 


agaaaaagct 


tagaacaggc 


21240 


ttggtagaac 


caagagtaga 


gcccaaattt 


21300 


acacaatgtt 


gctcctggat 


tatattccaa 


21360 


aaagtgtaca 


tttttaaatg 


taccatggac 


21420 


aaggtagcca 


cccatcccag 


agagagccta 


21480 


tggggttctt 


tttgatgcca 


aatgtcccca 


21540 


agcccttcag 


cgtcagggtg 


accaattgtt 


21600 


ccagaaagcc 


tatatcctgg 


gaaacccctt 


21660 


accctactct 


gtgaccgtgg 


taacaagttg 


21720 


tactatttaa 


gcaactgaaa 


taaacttctt 


21780 


ttcccctgtc 


agaagagggt 


gttgccgtct 


21840 
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tcatccaggc 


tggaggagac 


tatttccatc 


5 


gagaaaaaaa 


atcagtttaa 


agtaaggtgg 


gggggatgtc 


tgcaatgtca 


gcactgattg 




tgccactgcc 


aacatacccc 


tggcttcctc 


10 


crtapatr'tacj 

v-j I— V* CI O * — . l_ d. 


aatataacct 


t tggtccctc 




aaaagagcgg 


cttcagagaa 


aagacaaaag 


15 


aggtggatca 


ggcctatgag 


aaccacattc 




ttttaggaga 


caggattctc 


ccatagcatg 




aggctctgtg 


cgaggtttcc 


agcacatgct 


20 


t" at" ^ text 1~ aa 


aaaaaa tcrcrt 


tat cctcrccc 




ttcatgtgct 


gagtctttga 


aggaaaagta 


25 


taccatatga 


gcaatggagc 


ccacatgttg 




ttgtcaagtc 


caagcataga 


tttccctttc 




ctgagatcag 


aagattcatc 


ttccacattt 


30 


aaaaaactta 
y aa 33 au "-y 


c tcaagcaca 


cagatggatc 




ataggaggag 


aggatttcag 


tttctaagtc 


35 


tgaactcttc 


aaaaccctct 


tccataatcc 




gagcatatac 


cagcctcatg 


caccatccat 




tatcttcttg 


agtcaggatg 


tggcttttcc 


40 


d C*. tA. Wl^-H uA U4. 


aactactaca 


gagagaagca 




cctccaaggt 


ttgtgggaaa 


agtggctaca 


45 


ggagagagaa 


gcacagggtc 


taagcccatg 




aaatcgaagg 


catatgcttg 


tcatattgaa 




gcttgtcaat 


tgaaagccta 


ttttatgttc 


50 


u L.tyL>c*y i— i— ^y 


rrtcracctt 

V^, V»i 1— \-» ^ C* *w t» I* 


ttctctaaaa 




ttcagtttac 


cagaggggac 


cgaagtccat 


55 


ttctaaggct 


attgaggaag 


tcatgggaga 




tatcttgagt 


gctatagaga 


gtcaatgggc 




catgcatttt 


cttattcaat 


aagcatttac 


60 


acaagagata 


caaagatgaa 


taaaacacag 
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caaagactca 


tgcttgataa 


ttggatgttt 


21900 


gagaaaatga 


gggtgatgga 


gtaggggtga 


21960 


cagcaccaaa 


ctcaaagaca 


ataccactca 


22020 


aacaacttta 


gaattattat 


attgccctaa 


22080 


cctagaaacc 


caatgtatgt 


cttcttactt 


22140 


caaactgaag 


aagttggaat 


acctacctac 


22200 


accacaccat 


ttcccaactc 


ctcttctaca 


22260 


acgattcact 


agcaaagaca 


ctgatgtgcc 


22320 


cctctctgtt 


taagcatcac 


tggaagacaa 


22380 


ttttctactc 


cagctctgta 


gctatatgga 


22440 


gtcccagtgg 


gcagtatctt 


tgcccaaagc 


22500 


ttctttgtcc 


cacttgtgga 


gcccacattg 


22560 


tgtgtattca 


gtgcatttgg 


gaagccactt 


22620 


tataaaagag 


ataactgaag 


cccagagtaa 


22680 


agtgacagat 


gggtcagtga 


cagagctggg 


22740 


cagaactatg 


actccaacca 


atgaatgctg 


22800 


tgggttctag 


actgcagctt 


caatcatcag 


22860 


gtgtcctcat 


ctgtgcctag 


ccaggccctg 


22920 


ctaaatcacc 


tctatgggtg 


tgtggcaggg 


22980 


aaggctttat 


agctgcttcc 


tccctttctc 


23040 


gtctgataat 


atttctgttt 


gtgaatgggt 


23100 


tgcctctgcc 


tcatgccctg 


tcacttagga 


23160 


caaatattga 


ttgtcaattc 


aagatcaatt 


23220 


ccacagacct 


atagataggt 


tttccaactt 


23280 


ctgctctttt 


atgtaaatca 


aagtcatgga 


23340 


attctagaag 


ccctgaaggc 


aacagaatga 


23400 


tgttggcagt 


gacctgagca 


gcactgtctc 


23460 


cagaacccct 


ctccctaagg 


aggaattatc 


23520 


taattatatc 


acatgattaa 


ctaaatagcg 


23580 


tttgtgccca 


cacaaggaaa 


caggaaggaa 


23640 
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acagacccaa ggaagcacca ttgcagtaca 
gattctgtga tgagaaccca atgcagagtt 
5 aaatgcagtg agttctgtta ctgcaggagc 

agccctgagg actgcctctc aacattgtgg 
ttattcggag tactattatc caatatatga 

10 

tttaacagtg agtagtttca gactgaatat 
ttgcatactc tctgcttttc tggcagtctc 
15 acagaaataa acaaaaactt gtggttggaa 

ggatgtacta tagaaagtca aaagaggaaa 
ccactggtct ttcttaatca caaacgttcc 

20 

acctttgttg tactcatcac aggcagaaat 
tcttcatttg aattatattt taatttggga 
25 gcccctaccc cttaaagatg cttgtcttct 
ctgtacggtg tttaagttaa tcttcaaatg 
aaaacttgaa ccttaagtcc aaatacatag 

30 

cagagagtgg tgggaaggga gacaacggat 
gggcagcatt ttcttggcac ctttctccca 
35 caagccttgc ttattcaatc ctataaatat 

ggccttttaa gaagaaagac ttcttcctct 
ccaaggtagt atctctcatt cccttggatc 

40 

gcaagctttt gcactcctgt gggtctgctc 
ctttggttca ggagcttgag gaagtggaag 
45 tagatagtgt catcctggtg taggacagac 

atattggggg tttaatttaa agaaaatgaa 
agacaactgg agtccaaaag aacttcctac 

50 

aagcagactc caaaggccta ctctttgtta 
atgaaatttt aaaaattaaa aaatatatag 
55 cagaaggaaa aacaaatcta agaagaaaga 

ctaaagccaa atcatcaaac aagagctgag 
tattggtatt agaggctagg atcacacagt 

60 

acagggccat aaagctcata tggagtaccc 
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atggacagag 


gactaatagt 


gtcacgtgct 


23700 


tctcttccag 


gacctgagca 


gcttgacctc 


23760 


aggcctcttc 


ctccttgctg 


acagcatggt 


23820 


ctcagaaatg 


gattatctat 


cctcattgcc 


23880 


aatcaaagat 


tgtctcctga 


ccagtgtcag 


23940 


cctgtcatgt 


gaaatctctg 


tgtctcccct 


24000 


tggttggtcg 


gtccgaatgc 


ctggagacag 


24060 


gaggcctgtc 


tgctgactgc 


atggtgtggg 


24120 


tcatgaagga 


cagtagggag 


tctatcccct 


24180 


aaccttaatg 


ggtttaactc 


ttaaagtaca 


24240 


attcttccta 


ctcatcacag 


gcagaaatat 


24300 


ttcttttggg 


ttcttttcca 


ggggttggat 


24360 


gatcatcagt 


gaattgagga 


ctagcagagg 


24420 


caagcctgta 


tgctacatta 


tgttaggcac 


24480 


ttattttgta 


ctttggtcta 


acagggctga 


24540 


ttgacttcta 


gaaaaaatgt 


caacctctct 


24600 


aacttgatga 


tcactacctg 


ttcgtggtga 


24660 


agtatcatgg 


tgttaaaacc 


ataaaccagt 


24720 


atgaatcctc 


ctagtaggag 


cacagggaac 


24780 


catctatctc 


acaaatctca 


gaaaatgtgt 


24840 


atgtgaccca 


gagattgtca 


aaagcaaccc 


24900 


agcacagaga 


cctgtctgga 


ggaaacccag 


24960 


atgtggtaaa 


gcaatggggg 


tgggagaggc 


25020 


tttcattacc 


ccaaactgct 


tcccattctg 


25080 


aatatgagag 


aagttgcaac 


atagcagact 


25140 


attaacacag 


caattacctg 


aacaaaagta 


25200 


ccaaagttgg 


aagaagaaag 


gaagggaggg 


25260 


tgacttaggt 


tccagaagct 


aagaaggatc 


25320 


gccaaaatac 


caaggaagtc 


tgaataggaa 


25380 


tttaacatcc 


atgacaagat 


ttggcattgg 


25440 


tgcaaaaatg 


ttgaaagctg 


atactgaata 


25500 



-231- 



WO 02/062946 PCT/US01/48437 

tttactcaat ttgtgaaaat aatataagga aaattctgcc caacatccca agaagtttca 25560 

agaaaacctt ccgtcagctc actaagcttt gtgcaagtgt gccatctgaa ttcataatta 25620 

5 

catataaaat agggggacct caaataaaag cattaagtta aaaactgtta cgagatatgt 25680 

gaaacttcta aggttctagc agaatcaaat ctaaaactac tatctaagga gatgtccgta 25740 

1 0 aacccaactt ataagatgcc tccctatcaa agaaccctac cactaccacc atctacaaac 2 5 800 

tgaagttatc taataaccaa aatattatga gaaacataac aacagcacaa aaaaatgagt 25860 

agcggcagca aatggggaaa catgaaagtc agaagctcaa gaattggaga taacataata 2592 0 

15 

atttgaacaa ggctattatg aaaatacatt ttaaatcatt aaagatgttt aaaaagtaat 25980 

agaaaccagg ccaggtgcag tggctcacaa ctgtaattcc agcacattgg gaggctgagg 26040 

20 cgggtggatc acccgaggcc aggagtttga gaccagcctg gccaacatgg tgaaatgctg 2610 0 

tctctactaa aaatacaaaa attagtcagg gagggtggta catgcctgta atcctagcta 26160 

cttgggaggc tgaggcaggg gaatcacttg aactcgggag gcagaggttg cagtgagctg 2622 0 

25 

agatcacacc attgcactcc agactgggtg acagaatgag accctgtcta aaaaaaaaaa 26280 

aaaaagtaat agaaaccaaa atggaaaaaa aaaaaaagga cagtaaagga aaaaaaagat 26340 

30 atgtgcattt gagaaaaaaa tgaaatcaca tacttggaaa tgaaaattgt ctttgaaatt 26400 

aaaaactgag cagatcagtt aaacagcaga ttcaatatat tcaaggaaac aaagagtgaa 26460 

atggaattta gatttgatta aatcacacag aacacaacac attatgtgtg tttggaaaag 2652 0 

35 

atggatgaaa gcctaaaagg tatgatggat gcagggaagt ggagaggttc agtacatatc 26580 

tagtaggagt cccagtgaga gattataaaa gagagagtat atgcaaaagt aaataactag 2664 0 

40 aaatttttta gaatttaata aaaacatgag atttcagatt gaaggaatgc attgagacca 26700 

ctgcaggata cacagaatca aatacacacc tagaaatatc atcatgacaa taaaaacacc 26760 

atagtcaaaa agaaaataat aaaagcaacc agagataagt ggcagattac tctcatctat 26820 

45 ~ 

cattaggtgg gataccaaag ttctgagatg atgccgttgc tgcttatttc ccttcatcta 26880 

ttaagaaatc tgaaaaagaa agcagaaaag aggattaagc aattttccct gtctcctttt 26940 

50 ctctgcattg ctggagcctc atcgccctgt ctcctcttag ccaggcacaa gggctagctt 27000 

gtcttgggta ccagccattc atgtcctgat ttgcagggtg tcctagtggt gatgctcatc 27060 

ttcactgtgc tggagctctt attagctgca tacagttctg tcttttggtg gaaacagctc 27120 

55 

tactccaaca accctggggt gagtatgctg acatgtcgcc tgatacctgc tgtgtctcag 27180 

gtccaggcta caataatcca acctcaaaaa gtggcaaaaa gaagaatcaa ttattgttca 27240 

60 tgaggtgcat gtggaaggcc acttttataa ttaaaaaaat gagtttaaca gtgaaacccc 27300 
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atctctacta aaaatatgaa aaactagcca 
tagttgggag gctgaggcag gagaattgct 
5 tgagatcaca ccactgcact gcagcctggg 

aaaagagttt gtataaatgg gctccttctg 
gtgtctactg ggaagcagac cagttagaga 

10 

aaatcacata gcccaacccc tcatgaaaga 
tgaatgacac agactattgg gctgagaatg 
15 acacagctac ctgcctcccg agagagtctc 

gactttacta gaatgtctct gtcctgttta 
tagctcttga tgttataatt ttaaaatatt 

20 

agaaatgttt atagaatttg gatgaattat 
aaataattag cagactggag aattgcatta 
25 tatggcaaag ttggaagaac tagtactaca 
tcttttttaa tctccgaaat caaagtttaa 
ttcattttcc tcgacccagt cacaagatca 

30 

gtcttggata taagtaactc ttggcctcag 
caagtgtgat tagactttcc tgaaatctct 
35 aaaaaaaaag cttttgtttt gtatttgttt 

aataatttcc tcaaagccca agtcaataaa 
aaactttata aactgctttg ggtaaactga 

40 

gcactcatgc tagtgtgaat ttggtaagtc 
tttcacactc atattgctta aatattacat 
45 aattaaagac tatatataat taaactactc 



60 



actgggccct 


ttttcatgag 


ccattaggtg 


taggattacc 


tctcaacaca 


cagcaagcat 


aatccaaatc 


tctgcaatcc 


tgtggcacca 


tggccttctt 


ggctgggcgc 


ggtggctcac 


ggtggacgga 


tcacccgagg 


tcaggagttt 


tgtctctact 


aaaaatacaa 


aaaattagcc 


ctacttggga 


ggctgaggca 


gaagaatcac 


cggagactgc 


accactgcat 


tccagcccgg 
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ggtgcagtgg 


cacacgcctg 


tagtcccagc 


27360 


tgaacccagg 


aagtggaggt 


tgcagtgagc 


27420 


tgacagagtg 


agactcagtc 


tcaaaaaaaa 


27480 


gaggacactc 


tggtcatctg 


ggatcagctg 


27540 


attgcttaat 


atgaagctag 


ttggttttag 


27600 


accacggctt 


ggataaaatt 


acatagaaaa 


27660 


agaacgcgta 


tcccctcact 


tttatcggga 


27720 


cctagccaag 


tttccttgtt 


aatgtgatga 


27780 


tgcttccctg 


tgctcttcct 


tacatcccct 


27840 


ttcatttccc 


ctaaaggcta 


caacaacaaa 


27900 


gatttaacta 


tgatcctatt 


taaaatagta 


27960 


tttttaatta 


atattgctgc 


aattacatat 


28020 


caaaacttgg 


ggaaaaggta 


ttatgagtta 


28080 


aaattctaac 


caattatgta 


ttttctagag 


28140 


tatccaacag 


gtcaaaaaga 


gttcttcacg 


28200 


aggaaggaaa 


agcaactcaa 


cactcatggt 


28260 


gccattttag 


atactgtgaa 


acaaactaaa 


28320 


actatgagtc 


gttatttaat 


ttctcttgaa 


28380 


tgttatcagc 


cagtcttcca 


aaatggtcat 


28440 


gcagaaggtg 


atacacagaa 


gggaaaatgt 


28500 


gcgtgactct 


gcaggctgtt 


tctgtattat 


28560 


attagggatt 


gtaagaaaac 


tttaattaaa 


28620 


tgccctggac 


actctctgag 


aaacagatct 


28680 


gaagacagca 


aagagatctt 


ctcaagtgct 


28740 


gtccctgcct 


cccatgaatc 


ttaccatgta 


28800 


accagtggtg 


ctcattctat 


tgtttaaaaa 


28860 


tcctgtaatc 


ccagcacttt 


ggtaggccga 


28920 


gagaccagcc 


tggccaacaa 


ggtgaaaccc 


28980 


aggcatggtg 


gtgcgcacct 


gtagttccag 


29040 


ttgaacccgg 


gaaggggagg 


ttgcagtgag 


29100 


cctacaagaa 


caaaactcca 


tctcaaaata 


29160 
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aataaaaaaa taaataaaaa taaaaataga ctatatatag tctttgaatt catttatttt 29220 

aaataaatga ataaaataaa ataaaaaaga atattcagtc gttggcacat aaactatgtt 29280 

5 

ctcttctgtc atttcagggt aggagtcaat gaagacatct tagtcttaac aagttacaga 29340 

taatctctga ctggcatgtt ttctgcttct agcttggagc taaatgctgc tcatgctgaa 29400 

10 aagaataatg tctatttctt tgtttgggta ttagctctat tttgatctat agcacagttt 29460 

tgtaattcag atcatcaccc ctgcactttg taccacatgg gcgtattctt tcttcccagt 29520 

cattgctgca attatcttgt ctgttctttg tgtaatatgt tctgtgtgag cctctgcatt 29580 

15 

aaactcgatt tcttgggcaa ttatggaaat tccagtgtgg ctgcagttta actttgcact 29640 

ctctatgcat atgaggtttc ctaaataaat gaggagtagc atagtttaaa atatatatat 29700 

20 cttataactt tctacaacaa agaattattg agtccaaatg tcatcagtgc tcattttgag 29760 

ataccctgct atcgatggtc gctacaaacc aggaaatact caagttatta tgtgtataca 29820 

ttggttttag ttttatgaaa caatttacct tcatgatctc atagttaaaa ttgtaataaa 29880 

25 

tttaggaata taaaggatca atatgggaag caaaatttct aaaggcagtt tctgttgttt 29940 

taattagtat ttgtgtagtt caaaccagga aggatttgac tatcattaga ttttgcttaa 30000 

30 ctttatgaaa gctaaaatat tctctgttat aaaggggcaa ctccatctgg tcctatagca 30060 

tctttactac tgattttttt ttgtttaatt tgaaaatgca aagaattgtt aaatgttctt 30120 

aaatgttctc actacaaaaa aagaaaaaag ataactacgt gaggtgatgg atatgttaat 30180 

35 

tagctggatt gtggtaatca ttttggaatg tatatgtata tcaaaacatg tagtacaccc 30240 

taaatatata taatttttat ttgtcaaata tacctcaata aagatggaaa aaaatcgaaa 30300 

40 ttcaaagtta tctactgatt tttataatac agagaaatat ggttaaacag taagttagaa 30360 

aattcatcaa gtaaaagaaa ctcctttcaa aggagggcag aagcgtggct gagtttaaaa 30420 

aacatagatt ttggagacag gtcaagttga gtttgcccct tccctgtcct gtatgtcttt 30480 

45 

gggtgacata accttgctgt tcctcagttt cagtattgta gaactgttaa aagaataaat 30540 

gttagtgcat gtcaacacct tctacacagt tcccagaaca taaggaatat tccataagtt 30600 

50 ttagttcctt tataactcat gaacatatgt gtaaggactt gtttcgtata taccatctat 30660 

tctttgttta ccatatgttt gtaagcaaat tgacaagaga gtaagtcttc tggatacagt 30720 

acttttcacc aggaagatgg ggggctgagc atgctctaga attctaaaac tctgtgactc 30780 

55 

aactatgatt ctgagattct actactgagt agagtcatca ctaagggctc atctctgagg 30840 

gctccatgtg actctggtgg agaggtagat catgatttgg gcggcaatgt ttgctcactc 30900 

60 tttcccttac tagagttctg ccatagaatc atggagtcaa catcccagga cagaagggca 30960 
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actcacgtca tcactataaa accaaacgaa 
catagctctc tgctggattt tctgaaggga 
5 cagtcaatag gtcttccaga aggggagaaa 

agatatgcag agattatttt cagggaggga 
atgagaaatt agtactgatc tttgaatccc 

10 

ctttttaaaa tatctcattg aggaagtgac 
taagaaacca taagtggaga ggtttgactt 
15 gcttcaaaac ccatattcaa agctccacag 

tttcatcaca aatgcaccca ggggagagct 
gcaaaaatca ttacttggtg tgccacagcc 

20 

ctgccagaga agttgcgccc tgagaaccac 
caagctgata ggcccactgg gctattctca 
25 gtgaaatgca agctcatccc aggttctcca 

gacaactggg atgagcccca attccaaaag 
gggggctgtt aggataataa taggatactg 

30 

tgaatagttt atagtccctg tgctcaagaa 
ctcaaatctg tctgatataa gattaattgg 
35 aataaaagtg aaggatttga acctgaagga 

ttgaactgag tctcttaaca atctgagaat 
atggagggaa aggtgtgcca tggaggcaca 

40 

cttggatact tgtaacctca tgtacccatt 
tgctctaatc attgtgggct ttggaactat 
45 aagacttccc cttgttgtcc tcacaggata 

tgtcgattag aagctttgga gaaattgcta 
agggtaatga attccataaa tatggccctc 

50 

ggcaaaagaa gcaactccat ttggtcactg 
caggtgctta aaatttgttt ccaaagccct 
55 taccaacctg ggagactaga tgcacaaata 

tgagctcaca agtagagaat aacttaaaca 
cagactctga aagcttaaaa agaattttaa 

60 

ccagagacaa ctttatgagt ggcactcatc 
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actgtattga 


ctgcatttcc 


ctacagacct . 


31020 


gagccaagag 


tcttgggggt 


aagtcctctc 


31080 


agaagtttat 


ttattcatgt 


atgtctgcag 


31140 


gttaattcta 


cttttccccc 


ttcaccttac 


31200 


agcagagggg 


gaagcaagaa 


actagtataa 


31260 


ttcagggtgt 


ggtggcacct 


cccttgaagc 


31320 


ccccttcatg 


ctcttaaagg 


aattgggagt 


31380 


tgaaccaaat 


gataaggcat 


gcaaaacgtt 


31440 


gtatgcacag 


cctccatcct 


ttgtacaaca 


31500 


cagagatgtg 


gcctggggac 


aaaatctaga 


31560 


tgctgctatt 


cttttccctg 


cagattccta 


31620 


tccctagttg 


ggtccatgga 


tgaaggggag 


31680 


ctctatgtca 


tcaaactttt 


taaattttta 


31740 


cctatactga 


gtgctggaaa 


atgtgggggt 


31800 


gtgagacaga 


cgaaaaaaca 


aatgtaaagg 


31860 


gcttataatc 


tgggggtagt 


cagacatgta 


31920 


taacaccagg 


gaagatagaa 


aataataaag 


31980 


tccaggcaag 


caaagtggga 


aggaaacaca 


32040 


gattgacaaa 


tgagagagga 


aggcatttgg 


32100 


ccctgaggga 


cgtggggtag 


cccacagggt 


32160 


tctctcctca 


caggctaccc 


agatcctgct 


32220 


atttgcactt 


aattacatcg 


gtttctccca 


32280 


tccattctgg 


ggagcactta 


ttgtgagtac 


32340 


taaagataga 


tttgaactgc 


ttcacgtcct 


32400 


caggagacca 


aaaaaggcag 


ctcccttcac 


32460 


cactaccatc 


atcattgtaa 


taactactct 


32520 


tgtgtgtata 


gtaagacatt 


tgctcctcaa 


32580 


tcctacttat 


tcctttgcaa 


ataaggaaan 


32640 


tatctagcct 


acctagcact 


catggaatca 


32700 


aattattatc 


actcatcctc 


cttcttagcc 


32760 


tagataatca 


tatagccata 


aatattattc 


32820 
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ttaatacagt gctatttcta tccaatcccc 
tatttaagga cagactcatt agagcttctc 

5 

cagtgttact gtctgtttta gttgttatat 
agtgtgcatg aacgaacgaa tgaatcaatc 
1 0 ttaataaatt aggccactat ctctgactat 

aaatactcca tgccttttct ataatgtgca 
tcccaatata aataatgtaa tattcaatat 

15 

aaaatcaaac agaccaaccc ctcactttct 
tttgtgtact ctgcatgtgt gaattaaaat 
20 tacttttcac ttaaaagata tttactgagc 

gctgagactg agaatactgg aaaacagctc 
gatttagaca gtttgtcatc taactgggag 

25 

aaatatttgt tcctacactt gtgctaagtg 
agggtttaat agaggactca cctgaccatg 
30 ctcactaggc gaagtggaag gaaagcatcc 

tgatgtggga aagagtaacc aaaagaagac 
ggctataaag gtagggtgta accaggtcgt 

35 

gaacttcaat gtgaaagcat gggaaagcca 
tgaagacttg ttgaagtatc attgtgtttt 
40 taaacctagg agaatccaat gagaggaggc 

cgaactagct tggtggcaat ggagctagag 
gaagtgacag gacttggcaa tccaatgtgg 

45 " ' ~ 

actttctgat ctctgcaagt cagctgcaga 
aggtccaagg ccagttccca cagagaaggt 
50 gactccacat ggagtgacag agttctcatt 

tgggaaatga gggactctga taactacact 
gggtcttcct tgagaaagtc tcccttcacc 

55 

tcttcaaggc ccagagctca ggaatctctt 
tcctctggat cttagaggag cctgcctgtc 
60 agtgctgagc aaatccagca caaagtttct 
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tccctgatgc tcaggtggaa aaacgggatg gtcttggaga ttcacaaaca gatattttgc 34680 

aaatccaagg gaaattatac tgcaaggtgg tcaaacacat cacctgttaa cttttgtctc 34740 

5 ttgtcttctt aacagtttat tcttacagga tacctcacag taaccgataa gaaatcaaaa 34800 

cttctggtaa gccacttaaa ctacataaaa tttaaaatct tctatttgtt ttatatcatc 34860 

tttatgtatg aagcctttag taataaagtt acctttcagg aaaatctgta tgcatcaaag 34920 

10 

ttgcagttca gatcattcct atttgccagt gccactgttt gcctgacaga cattcttaca 34980 

cttgttccca tttccctggt caagtaaact aaaagagttg tttattggaa accattctgc 35040 

15 atccctgtaa ttcattgttt gtaattaggg gtaattccct gccctgcacc ccccaggacc 35100 

cagaaatatt ggacaacatc tgaagacatt tttggtcatt acaactggtg ttattggcaa 35160 

caagtgagta gaggccaggg acatttctaa gcatctaaaa tgcacaaggc agcctccccc 35220 

20 

caacacaaaa attatccaac aaaaaatgtt tatagtacca aggttgagaa accttgctct 35280 

aaccaaaacc aatcctaagc tctcactaat tcttcctccc aatcctagcc ctatctccaa 35340 

25 taccaaccga ggcttttctt tagatcctgc ccaaggtctg gaagaaagag tttgggcatc 35400 

aatgctggat tgttctagga ttgaaccaag gctatcactc acttctgtgt taacctgaag 35460 

ggaaaagtgc ctacttctaa gggttatttt gagagtttaa tgagacatta ctgataaaag 35520 

30 

gcaagatgat gcttaaatgt tcatactcct ttctaaccac tctgatcttg tttaaggtcc 35580 

tggggatgtg ggcctctctg aaagtcagtt gaactctgtt cttatggtcc tctgccagtt 35640 

35 tcctctaggg atagttacaa ggcatagaga cattctgagg ggagggcaga taaatgtttc 35700 

catccctgct tatggaaata catgcagaca atgcagatga agccccaaat gctgttctcc 35760 

ctgtattgac ttccttccag agctgagaaa taaactatgg gtaaggaaaa cctactcctg 35820 

40 

actcactgag agctatgcag atggaggtgt gcaggaatga tggaggcagg atgctgaacc 35880 

aggcatcaga agacctggct tttcatcaca gcccttccac aaaccgtgaa agtgattgct 35940 

45 cacagtcact tacttcctta attccattta tctgtaaaat gtggataaca acaagtgtcc 36000 

ttcctacagc acaaaagtat aatgagaatc agaagagact aggctgggca tgttctttgt 36060 

aaagcccaat acacatgagg gattgattat ctagagatct tagtctataa aatgtcctat 36120 

50 

gggagccaaa atagagcctt gagaaaacaa ggggactaaa gaccaagaga gagatggcct 36180 

atctatatgt tcctgtgcat tttctgccaa agtgcctatt tgcaataaaa acctgggatt 36240 

55 gaggtaaaca ttaagtcaca agtacacttc tgaggtttgc cacaccaagc cacaaacttc 36300 

cagaaaggac tggctgtgta aaggcatcat tgtattttcc accacggtag tcataacaca 36360 

atcaataaag gtacagcaaa tacctgatga atggaaatgt gaagcttttc tttgtttttc 36420 

60 

ctttaaaaga gtgatatcag tcctgggagt gggtaagaag gaagaaatca gatgaccccc 36480 
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ttcctgtaaa ggaactcgga tatacctacc 
aagaatgaat aaagggaatt gggtggggca 
tcaagaaaaa acacctttaa actccagtcc 
ttctgttgat tattgcttga attctcccag 
10 tgtcacctct tttgttcatt tttgtttctc 

ggtgcacatt aaatatttat gaaataaatg 
gacacattca catttttaaa ggcagaaatt 

15 

aggtcctagt gaaacttgga cttgaaatga 
acacattgta ctgcttccag attatgaaag 
20 taaaatgaac attttcttta gcattagcca 

gtttgtcttc tgactttaat gagttagaat 
cctgatggaa caactcttcc ctgagaagta 

25 

tacaagatat ttttgctttg gtttcgatct 
cagggtcaag gtgtcacggg catgaatgtt 
30 actttcacca ttctcagcta cagacatcaa 

gaaatatgtg ttttcagtag aactcttttc 
tccctaaaat cttctctgaa ctgtgtccat 

35 

taaatgaagt tgtgtcagca tttgaactga 
tacaatggtt aaggagcgat attttctgga 
40 ttggcctaat attgtattct ctctaagctt 

tgagaatgct tacctcatgg gttgttgtga 
cagttaaata gtcagtaggt ttagctaata 

45 

tttactgttt cagcattaat gaacaaaatg 
caaaatacag tgggggaagg tatttagaag 
50 ttaagacata cattatttca cataattgtg 

tcttttacca ggaaatgtag ttgtctgtgc 
gtgacattat ttttaagcat gtcagtgact 

55 

caagtcctgg ggttttattt tattcctacc 
ttgaagaata tttgaatccg ttagaaggga 
60 ctcagtaaat aagaaatgag aagttagttc 
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ccacccaccc 


acacccccct 


ccaaaaaaat 


36540 


gtgtgaaata 


J__ J_ J_ A_ X- _ X_ X- _r-u _M« 

tttttattgc 


aacaatgagt 


3 660 0 


aaataggtta 


acatctttaa 


atgtt aacga 


3 6660 


ggaatgactg 


taaaccacat 


gattgcagga 


"i _r ~7 o r\ 

3 67 z 0 


cagggcctaa 


tgcagtactc 


_ > _ 4— < 4_ a. J— _ i_ 

acatgtatct 


3 678 0 


aataaatgga 


tactacccta 


aaagttaggt 


3684 0 


gagattccag 


gagattaggt 


tacttgccta 


3690 0 


ggtttcattt 


ccaagcaaag 


_ 4— 4— 4— 4— - - _ 4_ _ 

atatttgcta 


3 696 0 


atggtggtga 


taaaatgaga 


taaaatggga 


3 7 020 


tagagcatcc 


ttagcccatt 


ctaaaac ttg 


3 7 0 8 0 


ttagtttatt 


aagccttcca 


atcatactga 


37140 


tctggaaaga 


taaaagcact 


aaaaatttct 


37200 


tatttatttt 


— X— ^ , n A ■ 1-1--- -t— 

atcattttta 


___ _ X_ — . 4— a, X- X-v JL. 

aactatcttt 


37260 


atcagctcct 


tggttgcgat 


aactgggatt 


3732 0 


gacaagtact 


gccagatgcc 


— 4— . *- 4- 4- — > — . 

atcct ttgaa 


373 8 0 


attgtaagtg 


~f._j.f- ~ 4- +- 4- 4- 

gtcttatttt 


gcatgaagaa 


_> /44 U 


gatgtaatga 


ttcactgatt 


4_4_4__-4_4_4_l__4_ 

tttattttat 


3 7500 


cggttgagaa 


gggtaccatg 


gggttactgt 


37560 


cttgaatcct 


gttcacacaa 


gcttgtcacc 


3762 0 


cggtt ttcct 


caagtggaaa 


atgaacataa 


37680 


ggattgctta 


aattaaaaca 


gtaccttgca 


3 774 0 


caattat tat 


ggccatagat 


tgaattaaat 


"5 n o ft A 

37800 


atctaaaagt 


ttgctcaata 


acactgagat 


37860 


tcatttaaaa 


aaataaggat 


ttttataagc 


37920 


gcaattttat 


tcattaaagc 


agatgagaga 


37980 


aatttaaagt 


atttatctaa 


caagtaatca 


3 8 040 


ccaataggcc 


tttctctcta 


gttccctaag 


3 8100 


agctatatct 


tgaagcatgg 


gttctataat 


38160 


aataaagtgg 


aatcacagaa 


aatggtcttc 


38220 


ctgaatacaa 


gaggtaattt 


aatcctcaca 


38280 
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atcactcagt gaaatggt cc cagggactct ctcatgtgtt ctgagataaa ttctaaaatt 3 834 0 

agactcatga tagattacta aagaaagcca gaaaacaaca taagcgataa aaccctgtcc 38400 

taactgggcc aagaacatga tacaacatta taattcttgg ccattgctga atggacagga 38460 

ggcaactact gtttgttatg ttcgtatata cgaggaagaa cttgtatata cccgaaaggg 38520 

ttaaatctcc agatatatga aattccgtat ttcctgctgt ccatggatgg gtattgcaag 38580 

attagttacc tgcatacaga accatgaaca atctcttcct agttctcact tacataactt 38640 

aacaatttct tgctgtaacc caaacatcat aagctaaaag agccagaaca gtgggcttta 38700 

15 gatgtctctt gggctgaccc ctgtcctaaa gaccaatcta ttgtctatac tgtggctaga 38760 

gtaatcattt aaaaatgtaa atctgagcca ttacttaatg gcttcctgta aaacgtatga 38820 

tgattaagac ccaaatctta acagccttac caaaccctgt aacacctatc ctacctatct 38880 

20 

ttccaacatt attccaggac ttcatccttc tcactcctcc acactccagc cacttcctat 38940 

aatcccttaa tgtgtcaagt atccacggcc ccaagatgat cagaggatca gcttcttccg 39000 

25 gctttgaacc gtcaccaggc cagttataca attaactcat cctcatactt aaatctcaac 39060 

tgaaatacca cttctttaag gaagttcttc ctgacctcac ctctatgctc atatctgcat 39120 

agatccccca ttagagtagc gcatagctcc caatccttct tcacagcatt tatcacaatt 39180 

30 

tagttaggta aataattatc taattacttg ttattgtctg cctcccttac catgatgtaa 39240 

gttctacaaa atccaagacc aggtatatct tgttcaatgg agaaagtcag taaattgcat 39300 

35 acaatagggt cttcatacat ttcagtttta ccaacaaata ttgaatattt aagattttgg 39360 

gtgtattgca gggaaaccaa gattcagaaa cacacagtta tttgttcaag gttaagtggg 39420 

actttggacc cacttttttt tttttttgag atggagtttt gctcttgttg cccaggtttg 39480 

40 

gagtgcaatg gcacaatctc ggctcaccac aacctccgcc tcctaggttc aagcaattct 39540 

cctgcctcag cctcccgagt agctgggatt acaggcatgc accaccaggc ccagctaatt 3 9600 

45 ttgtattttt agtagagacg gggtttctcc atgttggtca ggctggtcgc aaactcccaa 39660 

cctcagttaa tcctcctgcc ttggcctccc aaagtgctgg gattacaggc gtgagccacc 39720 

gcgcccggcc aaccttggac tcactgtaac tcaggtcttt ctgtcttcat agtttaaact 39780 

50 

tttgagacac cataaggtac catgttactt ctcaatataa acattacctt aaagcaaaat 39840 

gggcaaaatg gcaatctgtg ggctatatcc agcctgtata tatgttctat ctggctcttt 39900 

55 agggaaaaac aaacaaaata ggcattatgg tatccatgtt cccacatgac agcaattggc 39960 

tggaattgag taatggatgc ctcctatgaa tactattact ctctactctt tccctgttta 40020 

gttcacaatg actgcccagc tcccatggac attggagttt gttatccctg cctcatcaga 40080 

aagacacaag atgaaaattg tgcagtgtct gatggctaca gaaccatcca aaatatgggt 40140 



60 
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ttcccatact 


tggtgataga 


gtgtaagggg 


ttctcagctt 


atatatttaa 


aataacagct 


40200 


5 


ctcatttcaa 


ataccattta 


tttctcctca 


ttatttgcag 


ggaattttgt 


taatcttact 


40260 


gatcatcagc 


atagcagagc 


tcagcatctc 


tgtgactatt 


gcatccttta 


gaagcaagtg 


4 032 0 




ctggacacag 


tcagatgagg 


tgtgttctga 


tgtctcttaa 


tattattaag 


agaacaggtt 


40380 


10 


cctgttcctg 


ttgggatact 


gcacacaatg 


atgtctgatt 


tattagtagg 


aaaagcagca 


40440 




tgctcagatg 


ctgaaataaa 


acatctaaat 


ttaggaaggt 


cagagagggg 


caagcttaag 


40500 


15 


attgaagctc 


accccaccat 


ctgaaacaac 


tatgctttcc 


tttcttgtac 


tatctgcttt 


40560 


atactcagtc 


atctctgtta 


agacatagct 


ccttatgtct 


ctgccaaaaa 


tatgtaatat 


40620 




cagacttaac 


aggaacccaa 


attactgtga 


catggggtgt 


atattcttgg 


atcataagat 


40680 


20 


gtgtgacaca 


gttccaccac 


cacaaaagaa 


ataggaaaaa 


aagctgttac 


tactgatact 


40740 




acatagtatt 


tataataatt 


gttatagatt 


atatttactg 


aattctgact 


atgtcagtca 


40800 


25 


ctgttataat 


cactgagcac 


caacttaatc 


ttaacaaaac 


tattataaca 


tctttatttt 


40860 


acatatgaag 


gaagtaaggc 


acaaagaggc 


taaactaacc 


tctccagtac 


cacataacat 


40920 




caggattctg 


acccctgtga 


tatgtctcca 


gagcctgctc 


actaagtccc 


aaataccagt 


40980 


30 


acaggccttg 


tgatttatgg 


agatttcatt 


tctctctcta 


gctctctctt 


ttttttacat 


41040 




caactttata 


gaagtatcat 


ttataaacaa 


caaaatgtac 


ctatgtttag 


ctaccatcac 


41100 


35 


aatcaaggta 


tagtgcattc 


ctagcatccc 


ggaaggttca 


ctggaacccc 


tttgtagtca 


41160 


atgttctcat 


cacctagatt 


ggttttgact 


ttccaaaatt 


tcatattcat 


ggaatcataa 


41220 




gtaagtactc 


tctgtatatg 


gcttatatgg 


cttcttccac 


acagaaatat 


gtttttgaga 


41280 


40 


gtcatccatg 


ctgttgcctc 


tatcaagaat 


tcattccctt 


tatgtatggg 


tacaccacaa 


41340 




tttgtttatc 


aatttacctt 


tttttttttt 


tttttttttt 


tttggagtcg 


agacctcact 


41400 




ctatgaccca 


ggctggaatg 


cagtggcact 


gtctcggctc 


actgcagcct 


caatctcctg 


41460 


45 
















gactcaagca 


acctttccac 


ctcagcctcc 


caaatagctg 


ggactgcaag 


catgcgccac 


41520 




cacccccagc 


taaatttttt 


ttgtattttt 


tttgtagaaa 


tagggttttg 


ccacgttgcc 


41580 


50 


caggctgctc 


tcaaactcct 


tggctcaagt 


gatacaccca 


ccatgacctt 


ccaaagttct 


41640 




gggattacag 


gtgtgcgcca 


ctgcgcccag 


tccatttacc 


ttttgatggg 


tatttagatt 


41700 


55 


gtttaaagtt 


ctgaactatt 


atcataagtt 


actaagaata 


tttgagttca 


tatatttgtg 


41760 


tggacatatt 


ttcacttctc 


ctggagtgtt 


gcatctagga 


tttgattgcg 


agatcgtatg 


41820 




gtaaccagat 


gtttaacttt 


ataagaaaat 


accaaacagt 


ttgtcaaagt 


ggctgtattg 


41880 


60 


cttttcattc 


tcatcaacaa 


tgtgtgagag 


ggacagttgc 


tccatatcat 


taacacttga 


41940 
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aactgtcagt ttattaataa tttagctatt 
gttttaattt acatttccct aatgactaat 
5 tccatttgtg tatcttcttt tgagatgtct 

tatcttctta ttggccttat agcagcctaa 
ttttttcaca ctgtgacttg ccttttcatt 

10 

ttgatgaagt tccatttatc atttttgctt 
caggttttcc ctacccccaa atcatgaagg 
15 ataattttag cttttgtgtt tagatgtatg 

ggtaaggtta gaagttaatg cacatccccc 
ttgttgaaaa tgcaatgctt tcctccatca 

20 

ttattattta tttgtaggtc tcttttggat 
ttccacgtat tctacactct cttcatttct 
25 gtgaaatata tataacataa tagttatcat 
ttaataatgt tgtgtaactc tcaccactat 
cacaaactct gtaccacaaa acaatcactc 

30 

cctctatcct actttctata ttatgaattt 
tcttacaata tttgtccttc tctaactggc 
35 taatccatgt tgtagcatat atcaaaattt 
tttgtgtata taccatattt tgcttagcca 
ccttttgcat attgtgaata atgctgttac 

40 

ccctgttttc aattcttttg gatacatacc 
tctgtgttta atgttttgag aaaatgccaa 
45 catttccact agcagtgtac aaggattcca 



50 



60 



acaggggaaa 


agggggagta 


attattttcc 


cacataattg 


tacatattta 


tggggtacag 


aatgatcaaa 


tcatggtaat 


taccacattc 


tgacattcca 


aattttctct 


tctagtcacc 


tagtcatgct 


actaagtata 


gaacttattc 


ttcctgtcta 


gaattttata 


cccattgacc 


cttccctagc 


ctctggtaac 


cactattcta 


gattccaaat 


gtgtgtgaga 


tcatgcagtg 
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ttagtgggta 


tgtagtggca 


cctcattgta 


42000 


gatgatgaca 


ctcttgctat 


atgtttatta 


42060 


gttcaaatat 


ttagctgaat 


tttattggat 


42120 


ttaattctta 


atacaaatcc 


tttacaaata 


42180 


ttcttaatag 


cttctttgaa 


aaaaacaaat 


42240 


ttatggttat 


ggtttttgtg 


ttctgtttaa 


42300 


ttttcttcta 


tcttttccta 


cagaagtttt 


42360 


gtctatctta 


actttcatgt 


acagttgtag 


42420 


ctcatggaga 


tatccagttg 


ttaacgccat 


42480 


ccttggtatt 


tttgttgaaa 


atcaatttac 


42540 


actttttccc 


tctggccact 


acgtctatcc 


42600 


gtgtctttag 


agtaattctt 


ttgcattgtg 


42660 


tttagtcatt 


tgttcaaatc 


attgaataca 


42720 


ctatgcccca 


aaatttttca 


tcatccccaa 


42780 


cccgattccc 


ctcctgattg 


cccctggtac 


42840 


gccttttcta 


gagacctcat 


ataagtggaa 


42900 


ttatttgagt 


aagcataata 


ttttcaagat 


42960 


cattccttct 


atgactaaat 


aaaattccat 


43020 


ttaatttgtt 


gatggacaca 


gatatttctg 


43080 


gaacattggt 


gtaaaaatat 


ctgctcaagt 


43140 


taagagcgaa 


attgctggct 


tatatggagt 


43200 


attgtttttc 


accttcgttg 


caccatctta 


43260 


atttctccac 


atctccatca 


ataccaacca 


43320 


atttattttt 


aattttcaat 


ttatgattga 


43380 


tgtgatgctt 


caatgcatgt 


tacattttct 


43440 


atcactttaa 


acatcatttc 


tttgtagtga 


43500 


tggaaatata 


cactacgttg 


taatttgcta 


43560 


ttactatcta 


tagtaagtaa 


gaacttactc 


43620 


aatccttccc 


ggtcctccca 


tcctccctac 


43680 


gtcttcactt 


ccatgaagtc 


aactttttta 


43740 


tttgtctttc 


tcagcctggc 


ttatttcact 


43800 
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taatataatg tcctccaggt taatgcatgt 
tatggctgaa tagtattcta ttgtttgtgt 

5 

aatgtatatt ttatatatat atatattaca 
atttaggttg attccttatc ttggctcttg 
10 agatgtttct ccaactgtct tcatttcctt 

atcatatttg tttttatcat ggccatcttt 
tctttctttc tttttctttc tttctttaat 

15 

cagtggcaca agatcatatg tcacttaggc 
gcctcaaact cctgggctta aatgattctc 
20 aaggcatgtg ccaccacacc cacataaatt 

aggtcttgct atgctgtcag ggctggtctt 
ttggccttcc aaactgctgg gattaaaggc 

25 

gccatcttaa tgggtgtgaa gtggtatctc 
tgatgttgag catccatgta tacatcttct 
30 tttcttaatt gggctgtttg ttttttgttg 

atcttagtcc ctcagtagac atatgatttg 
tttgcacttt cttgatgact tttgatgcac 

35 

ttattttgtt gttttctgtg cctttcttgt 
tgaagatatc ttccaacttt tttccatgaa 
40 cggattgatt ttgagttaat gtttgtatat 

tcacatgtgg atgcccagtt attccagcat 
aatattcttg gtgtctatgt tgaaaattga 

45 

tatttctgaa tgattccatt ggtctatata 
gggttttttt ttttgtttgt tttttgtttt 
50 aggctggagt gcagtggtga gatctcagct 

aattcttctg cctcggcctc ctaagtagct 
ctaatttttg tatttttagt agagacgagg 

55 

ctcctgacct caggtgatcc acccgcctcg 
agccactgtg cccggccctg tattgtttta 
60 ttggaaagta taatcactca aactgtatta 
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tgccacaaat 


gacagaatta 


cattctgttt 


43 860 


gtgtgtgtac 


attatatata 


tgtaatatat 


43 92 0 


ttttctttat 


tcatttatgt 


gtagataggc 


43 980 


tgaatagcac 


tgcagtaaat 


atgggagtac 


44040 


tgaatatata 


ccctgtaatg 


ggatttctgg 


44100 


A_ X— i_ M X- 1— X— X— X- 

tttctttctt 


tctttccttc 


tttccttctt 


44160 


1_ X- X- X- _ x_ _ _ _ 

ttttatacaa 


agactatgtc 


acttaggcta 


44220 


tacagtggca 


caagatcata 


gctcactgca 


44280 


ctgtttcagt 


ctcctgagta 


cctgggatga 


44340 


tttattttta 


tttttat ttt 


tgtagagaca 


44400 


gaactcctgc 


ctttaagcca 


tccttccacc 


444 60 


atgagtcacc 


acacccagcc 


ttatataatg 


44 52 0 


* _ a X- X. X- X— m . „ 

atacttttga 


A_ X— -I— r-.. — X- X- X. X* *a» 

tttgctttta 


cctaatgatt 


44580 


ttggagaaat 


aactgttata 


gtcttgcctg 


44640 


ttgaattgta 


gagttcttta 


tatattctgg 


44700 


caaatatttt 


ctcctattct 


gtaggttgtc 


44760 


aaaagttctt 


tatgaagtat 


aattaatcta 


44820 


cacatccatg 


aaattattgc 


caaattgtca 


44880 


atgtatggtt 


ttagctctta 


ggcttagtgt 


44940 


ggtataaggt 


aagggtccaa 


ctctattctt 


45000 


catatgttga 


aaaaattcct 


ttccccattg 


45060 


tttactatat 


atgtgatata 


agtgagcatt 


45120 


tcttccctta 


tgccaggtac 


catattttt t 


4518 0 


ttgtt ttgat 


ggagtcttgc 


tctgtcaccc 


45240 


cactgcaaac 


tctgcctccc 


gggttcaagc 


453 00 


gggactatag 


gcgcatgcca 


ccatgccagg 


45360 


tttcaccaca 


ttggccaggc 


tcatctcgag 


45420 


gcctcccaaa 


gtgctaggat 


tacaggcatg 


45480 


attagtgtag 


ctttacagta 


cgtttcaaag 


45540 


ttctttttca 


ttattgtttt 


ggctattcag 


45600 
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ggcctcttaa aattcaatat gaacttgagg 
gttggaattt tgatagaaat tgtattgaat 
5 ttaacaattt taaatcttct tatacgtgaa 

ccaattacat tcagcaatga tttatatttt 
agatttattc taaagtatgt aattatttta 

10 

atgttatttt catattgttt attgctgtta 
gctcttgtat cttgaacctt gttgcaatcc 
15 ttgggaggat tttctatata ggaagatcat 

cctcttttct aatttggatg aattttattt 
cttttagtac aatatttaat agcagcagtg 

20 

ttagggggaa agcttttaat ctttcaccac 
aaataaactt tacatattga ggaattttcc 
25 catgaaaggc atttcatcaa atgccttttc 

ttccaacatt atattaatgt aatgtattac 
gtattttgct tatcaatttc acttcatcat 

30 

caatttttgt atctacactt ataaaggata 
aattatctgt ttttggtatc aaggtaatgg 
35 ccttctcttc aagttttcag aagagtttga 
ttggtagaat tcactggtga agccatctga 
gattacgaat ctttttactt gttatggatc 

40 

ttttaggtaa tttgtgtgtt gcaaggaatt 
tggcatataa gtttttatag tattctttta 
45 tcagtaataa tgtcttactt ttatgtcttg 

cagtataggt aaaggtttat caattttgtt 
attgattttc tctattgttt ttcaattctc 

50 

ttatgctaga gtaattcttg aaatcagata 
aaattgtttt ggctgttcta ggtcatatga 
55 tgattgctac aaaaatcttg ctgggatttt 

ttataaacaa tttacttctt agtaatatgt 
ctgtattttc ttacatctca tttcatttct 

60 

aagccttatg catactttgt taaatgtatc 
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atccattttt 


ccatttataa 


gaaaaagact 


45660 


ctgtatattg 


ctttgagcag 


cactgacatc 


45720 


cacaagatgt 


cttttcattt 


atttaggtct 


45780 


ttagcataga 


ggtctttcat 


cccattggtt 


45840 


ggtggcatta 


taaatgaaat 


tgctttttaa 


45900 


tacagaaaca 


acaatgattt 


tttgtgggtt 


45960 


tgtattaggt 


ctagtagctt 


tcctgtggat 


46020 


gtcatctgga 


aatacagatg 


gccttaatct 


46080 


ctttttattt 


tctaatagct 


ctagctagaa 


46140 


aaagaaagca 


ttcttatata 


gttcctgatc 


46200 


tgagtatgat 


attagctgtg 


ggtgttttat 


46260 


atctcttcct 


agttttatga 


gaggttttct 


46320 


tactccaagt 


gagatgactg 


tgtgggggtt 


46380 


atttattgat 


tttcttattg 


taccatcctt 


46440 


atgaataatc 


cttttctaat 


attttgttgg 


46500 


ttaatctgta 


attttctttt 


cttgcagcat 


46560 


tggcctcata 


gaatgagtta 


ggcaattttc 


46620 


gaataattgt 


tttttttttt 


tctttaaatt 


46680 


tccagaagtt 


tcctttgtta 


ggaactttta 


46740 


tgtaaagatt 


ttctatttct 


tcttgaatca 


46800 


tgcccatttc 


ttatacagta 


tctgatttct 


46860 


caatacttta 


tacttttact 


ctctataagg 


46920 


ttttaattgt 


tttattgtct 


ttttgctttt 


46980 


gatcttttaa 


aagaatcaac 


atttgggttc 


47040 


tatggagatt 


tctcttctgc 


cccactaaaa 


47100 


gtataaatct 


tccaactttg 


ttatttttca 


47160 


atttccacat 


aagttttaga 


attcgtttat 


47220 


ggttggaatt 


gtactgagcc 


tatagataat 


47280 


agtcttctga 


tccatgtcca 


tgaaacatgt 


47340 


ctcagcacta 


ttttattatt 


tacaatgtat 


47400 


tgtaattatt 


tcattttttc 


ttgatattgt 


47460 
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acatggtatt tttaatttta ttttttaaat 
tttttatata ctggtctact atgacctttc 

5 

ttttaataga gtttctagaa ttttcttagt 
tttttactcc caaactttat gcctcatttt 
10 tacaatgttt aaaataagtg atgaaaatag 

aaaatgttca atattttacc cttaaatatg 
tttaccctgt tgcgcagatt ttttaaaatc 

15 

atctcatgct ttcatcttta tcgtattatc 
accctgtttt aagcccttta gaaacaatac 
20 ttgaatttag tgaatacaga tctttaatcc 

caggggattc aaaaataata aaatatgtac 
aactctacat ttacggagaa atttctctgt 

25 

atttggtctt tagtcttagc caccctgaag 
agcatagatc agagaggaaa gcacaggacc 
30 ccaagtaaac agctctacga tactggtctg 

tcaattctaa aagaaggata ataacttcat 
ttggaacatt gtctagcatc tggtaatcac 

35 

aaatcatagc acttaagagc tttaataaac 
ctttttaagt ggaagaataa attaaaattc 
40 atatagaaat ctcttccaaa gtaccattta 

aaaatatagg tgtgaaaaac taattagaag 
atgtgtagat tctcaatcta ggcctgctat 

45 

ttaatcactg agttctgatc cacttcatca 
attatctcag aaaggtgtaa ttcatggagt 
50 taacaacctc ctaaaaatta taataataaa 

tatcagatgt ggtcacttag ctgtttaaaa 
gaaagtccaa atttcctagc ctagcatacc 

55 

taccatctgc catcacacca tgctatatca 
ctgctgcttc tacctggatt ctgactcctg 
60 taattcctac tcagctgtta agatatagat 
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gttcattccc 


acatatagaa 


atgcaattta 


47520 


taaactaact 


tattagttcc 


catagctttt 


47580 


agataaccat 


tttaagtatt 


aataaagaca 


47640 


tcttgcccta 


ttacactgtg 


tagtctccaa 


47700 


caatctttcc 


ttgttcttga 


tcttagggtg 


47760 


atgttttttg 


taggtttttt 


acagagtaaa 


47820 


atgaatgggt 


attggctttt 


gtcatttttc 


47880 


tgtatattac 


acaactagag 


gagagtacaa 


47940 


taaaataagt 


tacactaaat 


aactcttcaa 


48000 


actaatggca 


cagtaatgtg 


caaagtacaa 


48060 


cttgatctca 


agaaaaggaa 


agggaaagtg 


48120 


gccagtttct 


gtgcttgaaa 


gtattatttc 


48180 


taggaagtag 


gtgacgttat 


taatacaaga 


48240 


tcagattcaa 


ctacaatttt 


taaatccttc 


48300 


gttatttaac 


atctctgaac 


ctcaattccc 


48360 


agggatgtca 


taccaatttt 


tgtgatgact 


48420 


tgagttctca 


ttaatggatg 


aggaaattta 


48480 


agagaaacta 


taattatttc 


ttccaaacct 


48540 


acatgaaaca 


aagtttgagg 


attcttagaa 


48600 


tactcacata 


aatagtcctt 


caatagggat 


48660 


aatgggaaat 


gcaatgtatt 


atttcagaga 


48720 


tagcaaacga 


ggtaagcata 


attaattcat 


48780 


aatagaacaa 


gtagtctatt 


ctgactatga 


4 8840 


cagaataaaa 


agcagcaggt 


tgtgctgcag 


48900 


atgagaacta 


atgggaagca 


ttgcaggttg 


48960 


aactccaatt 


gcttcctctt 


atttgtaaag 


49020 


atctaataat 


atcagtaata 


tctaactgct 


49080 


caaccttgtg 


tttttgtacc 


cgctattcca 


49140 


tcttatttca 


tcttaccgtc 


tatgcctgag 


49200 


taaatattgc 


ctcttctgta 


aagcatttcc 


49260 
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aggttgtctc agaattaggt catcttctat 
gtaattgcat ttgctgaatt gtattactat 
5 tgtgagcacc ttaagggaaa atcccacatc 

atctgggtga tcaataaact attatagatt 
atataaaatg tctcagaagc tttataatat 

10 

taatcaaatt cctctgttca atatttgaaa 
ccaataagtt aatggtaaag gcaggactgg 
15 tatttcctat ctactgtctt aagtcagaaa 

tcagacatga ttctgtcata ttaattaccc 
ttgccttcgg atgttactca aaatagtgaa 

20 

caatttgtgc ttcaagaaga gttttccagt 
atctttggag gctatgcttt cttcaagtta 
25 ccaggtaata aaggtagaga atttgtgcca 
cccaaatttt cagaggaaga aattgaacct 
gaaaatatgt ccattcagct agactctaca 

30 

tctgctattg tacaaccttc tcaaatgcaa 
ctccaagttt ttccatccca ttctgcacta 
35 ccatcccaag ctctaccagt agaaggcctg 

tcatcccatg tcaaacagtc ttctaatctg 
ctatcccaag acacatcatc tcaagatatg 

40 

caatccctag atatgctatc tcaagacaca 
ccttcccaag atatgctatc ccaagctcta 
45 acatcccata ttgtgcagtt ccctgaaata 

caaccagaaa acactgaacc tcaaaaccag 
agatcagaag ttatggaaga gaccaaagaa 

50 

aaatcctcaa gacggcattc cttaaaccag 
cattctttag acgtgcaagc caaaggccag 
55 caaagcaaag gctggcaatc tccaaagcag 

ctatccccaa agaggcactc cgtagataag 
ctcccagatc agcaagctga agatcagcaa 

60 

tctaaagatg gacaagttaa agaccagcag 
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gttcccataa 


agcactaagt 


acctatttat 


49320 


tatctattta 


tatgttgatg 


tacagatagt 


49380 


taattgtctt 


tgattcccag 


catttagttc 


49440 


aagcatgaat 


atatgaatat 


tgactagatg 


49500 


taagtatggg 


aggaattttt 


gaggcaatct 


49560 


gactaaaagg 


tgatgacttc 


taaaataata 


49620 


cattcgtttt 


ctgccttctc 


atttagatgt 


49680 


tgaaaagtaa 


cttggaaaac 


aaaaaaaaaa 


49740 


tcatccttgc 


ttttataggt 


tctgtttttc 


49800 


caacctgccc 


cagaagaaaa 


tgatcaatta 


49860 


gatgattcaa 


caacaaatgc 


acaatctgtt 


49920 


acactctcta 


ggagtccttt 


agtctcccaa 


49980 


gatgaacaaa 


agcaaagtat 


ccttccatct 


50040 


ttgcctccca 


cactagagaa 


aaagccctca 


50100 


tttaaacaaa 


tgaaagatga 


agatctacaa 


50160 


accaagcttc 


tgcaggacca 


agctgcgtca 


50220 


aaactcgaag 


atatatcacc 


tgaagacttg 


50280 


tcagaacaaa 


ccatgccatc 


taagtctaca 


50340 


acagctaatg 


acctgccccc 


tcaaggcata 


50400 


ctgtttcatg 


acatgacatc 


ccaagatatg 


50460 


ccatcccacg 


ccatgccacc 


tcaagacata 


50520 


tcagcgcatg 


ccatattacc 


tgaagcctca 


50580 


caacacctac 


ttcagcagcc 


cccagatctt 


50640 


caaattttac 


aaatgtcata 


tcaagatatt 


50700 


tggaaatctg 


aggaggaact 


ccatagaaga 


50760 


caaaccaaag 


ccttgcaata 


cttaaggaga 


50820 


aaatcctcaa 


agaggcattc 


cttagatcag 


50880 


aaatccttag 


accagcaaat 


caaagactgg 


50940 


caagctcaac 


ttaatcaaac 


taaagagcaa 


51000 


gccaaagggg 


aacaataccc 


agaaggacaa 


51060 


actgataagg 


agcaaaactc 


aaagaagcaa 


51120 
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acccaggatc agcaaactga agaccagccg gcccaagaga agaaatcccc gaaaggacaa 51180 

ttccaaaatg ttcaagccga aggacagcaa gctcaggtgg agaaagtgcc aaaactgtta 51240 

5 

tgccaagatt cagaatccca aatacagcaa taccaattct ggcaattcca caaaggcaat 51300 

ctccaggctg gacaacccag gactgtcaat cttttggcca agaatcccct gactggataa . 51360 

10 ctcagggctg gagaaacaaa gattataaag cacgagaatg gcaatttgaa atgaagcact 5142 0 

ggcaaacaca ggatctatta gagaaagaag ccctaaagca gaaagctcta taccaagaag 51480 

tccaaaccca gcacgcaaca gcccaacata acctagaatg tcaagacact caagataaag 51540 

15 

accaacaaga ccttcaatcc agagttacac aaaaaggaga tatgtacact agagacatca 51600 

aaccagggga catgaaatgt atagggcaaa cctcagggga cctgcaatca gaagacgtga 51660 

20 aggcagattt tcattcttct tctggccaaa gctcagtaca agacacatgt ttagcctatt 51720 

tgtccaatct agattcagaa caagatgtgc aaccagacac ttcagcttcc tcaaattcat 51780 

ataaagaaga tgtgaattta acttctactt catgtgatcc aaaagatcaa cagcaatctg 51840 

25 

aagactctga ctaacatgca gaatctaccc aataccacac tgcccccatt aatggaatta 51900 

aattgggaaa aacaatattg cctcctccaa tctgtgttct caactgtggt tgccacctca 51960 

30 ttaacttaca aaaaaatgaa gggcatgctg agcactcaaa caatttgttc ttacttaaaa 52 020 

taaaatgaca acaaaccaaa tgttaacact gtatcatcac tttatgtatg tgaagaaata 52080 

attcacatgt atattccttg tcatcaaagt tagtgtttca gtttataatc acgatttcat 52140 

35 

tttataactt aatgtttaat ggccctactt tggtatgatg ctagtatttt tcatctcccc 52200 

cttttggcat tataggagtt cagaaaatca accacaggga agtctatgaa ggctctctaa 52260 

40 gggttcaaga accaaaagag tcagcaagga acaaaaattt ctcccatttc cctgtcttgc 52320 

tgtttccaat cctcttattt actctccctc ctcatattta tgttgaggat atatggactg 52380 

agtttgctca attattaatg aggtagaatg tggcaatgaa tacagaatgt gatgatggag 52440 

45 

tgggagggat taagagatat ctagagacag gaaagtagaa attagggaag aacagagatg 52 500 

cagatgtcat aatagaaaat atgacatact gtattgtatt cagtgaagct tcagctccct 52560 

50 tctttatgcc tgcatttact gattataatt tctcaaccct tatgatacat aatttcatcc 52620 

tctgtcatca aaggaaattg tgttatcaaa agctatcagt tacctcaatc ttccaagcct 52680 

aatcgtgttt tttcaatcct ctttcccttt atctctgata catttggaaa ggttaacaat 52740 

55 

cccttttctt tcaattttta tcacccccat taacttctgt gttttgcact tcctgtttca 52800 

aaaattctat aataggtatt tctagggatc tctccccaac tttattcatt ttttgtaatg 52 860 

60 acaattcact attacagttc tctcaccttc ttgtgaataa tccagactgt gtcatatttg 52920 
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10 



tgtcttctac agaatttcag cacaaaggct tgattcaaga atacattcaa taaaacattt 52980 

ttaagggcat aaatgggagc ttggagtctc tacttagagg tgaacagaga agttatatgt 53 04 0 

agtagtataa taagatacag agctccttat gactggctac atgttccttg ctgcctgtgc 53100 

ctttgtagta aaatattagc ttcctgagct tcacatgatt gcttgacctt gatagagtcc 53160 

ttacctgagt tagcctttag ttcctgaatg accaccacct ggtattttaa cttactctga 53220 

ataatactaa taatttctac cttttccttt agctagatta gcacgtttct acagtgctta 53280 

tgtagagttt aactttcagt ctacaaactt gggtgtttag gaagtatata tatatatata 53340 

15 tatatatata tatatatgta tgtatatata tatatatata tatgtatgta tatatatata 53400 

tatagagaga gagagagaga gtgtatatat atatatatat atacacacac acacacaatt 53460 

ttgtatcaaa ttctttgaga atccagccca cacagttact cttgccagtt catccctata 53520 

20 

tctacccaac atacacaaat acatttctgc agaaatgtac aaagatcctg gaatcttaaa 53580 

tagtgccaag tcactggaga ggcaatttcc atcatcagcg actagtccac accagttacc 53640 

25 actgagaaac ttataggcca agttcaaata taccgtatgt gtcctgtgaa cttgttgctt 53700 

ctatgctaca tttggaacat ggaaattttt ccaaaattgc aagaatacaa cgcctatagg 53 760 

ttagacttac aatatgagaa tggaagaatg taaaaaacca cttcttctga ggcctttcca 53 82 0 

30 

tcccactgag ggagatgcta aggggcaagt cagaagctcc caaggccccc agaacccata 53 880 

gtaacctttc tagtctacag aagactctct tttctctctg tttcttgaag gcattttccc 53940 

35 ttgacctccc ctttctgaga cacctaatac tgtcctatca atgagtttgg gtctgaagtt 54000 

gacaaaatca aatgtgttta agacccataa ggaaaaataa tgtttgaagt atctgtttaa 54060 

aagacaaaat gcaaaaggct gtgggaaact agaaaattta tgatttgctt aaaggcaacc 54120 

40 

taattttttt attaaaaact gtcttattac aaatataata tgtgtttcct agaatatgtc 54180 

aggcttcttt gtagtaaaat tgggaaaagt tgatttgaga tactgctacc ttacacactc 5424 0 

45 agttcccaag aactgtggaa aaaaaaataa gctggctgct tacaattgca ttttcattta 54300 

ctagaatcaa caacctacag ctaaattata cttcccagcc accttgcatc taggtaggtg 54360 

agatggtaac tattgccaat ggaatgtaaa gtttgactcc taaaatcacc aacattatat 5442 0 

50 

ctacaatatt tttttccctc atgcactggc tagatgcaga gaatctagga tccacaagac 54480 

cctgagtctc tagagaatgg agcctggtcc ataaatcgtg gtgtggatta cttccagttt 5454 0 

55 gaatattcaa ctggattgtg atctaaaaga aaaacccata ttagggcaag gaatgcatgg 54600 

agctgtttgg ctagaaggta gtctaagctg agtaatagag tctttaaggc acaaaaaagt 54660 

caaatcttgc tcaaaatggc aggactccta ctcgtatctc cttgtgttag ttacctattg 54720 

ctgcataaca aattaccaca aaggtagctg cttaaaacaa caagcttatt atctctcagt 54780 



60 
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tcagaagccc 


aggaaagctt 


ggttaaatta 


tttgtacaga 


atatcacatg 


gctaaaatca 


54 84 0 


5 


gaggactggc 


cagggcagtg 


gttctggggt 


tgagctcctc 


ttcaaagttc 


atccattgtt 


54 90 0 


ggaagaatcc 


agtcctcacg 


gctgcaggac 


tgaggttccc 


atttccttgc 


tggctgtcaa 


54 96 0 




ctgggaactg 


ctcctgcatc 


ccttctcatg 


tgatcccttc 


catcttcaac 


caggtaacag 


5502 0 


10 


tgtgtcaagt 


cttctcaaac 


ccaaatctga 


ctccctctcc 


tgcatctggt 


tttaaggagt 


5508 0 




ccttttatta 


cactagactt 


actggaataa 


tccaagataa 


cctattttag 


gataaattga 


5514 0 


15 


ttaataacct 


taaatttctg 


tgcaaagttc 


cttttctcat 


tgaagataac 


cacaattagg 


552 0 0 


aaagattatg 


ggctcccata 


aatagtcaca 


tggaaaaata 


cacacaattt 


taaaaattaa 


55260 




tgttttccta 


atttcttaga 


aacatgtttg 


acttttaaag 


caggaactaa 


ctgaattggt 


5532 0 


20 


gttttaaaga 


gattactgtg 


gctgcagttt 


ggaggatgaa 


ctggagaaag 


aaagatggca 


553 8 0 




gcagggagtc 


aattaggaaa 


ctcttaaaat 


agtgaagaaa 


gaggtgctaa 


gtgatgtagg 


5544 0 


25 


gatgtataga 


agagtagaga 


ctaaaatgat 


aattcaagag 


taaaattgtc 


aggatgggac 


55500 


agaaactaga 


ggtgtagaat 


tgaagaaaat 


gaagaataaa 


ccatctgctc 


tagcttaaaa 


55560 




gattagctgg 


ctgagacatc 


attaccatca 


acaaaacaga 


cgaagaggca 


caggttgcag 


5562 0 


30 


caagagtaag 


gggcatagtc 


agttctttct 


agtgtagtgt 


attaggcagg 


gttctccaga 


55680 




gaaacagaac 


caagaggatt 


agacagatag 


acagagagct 


atagatagat 


atatagatga 


55740 


35 


tagatatata 


gaagatagat 


agatagatag 


atgatagata 


gatagataga 


tagattgata 


55800 


gatagataga 


tagatagata 


gatagataga 


tagatagata 


aggaattggc 


tcacatgatt 


55860 




ataaggctga 


gaagtcccaa 


gatctgaaag 


ctggagaccc 


aggagagctg 


atggtatagt 


55920 


40 


tccagcctga 


gtacgaaggc 


ctcagaacca 


agagtttaaa 


ttccagttca 


agtcttagtc 


55980 




tgaaggtagg 


aaaagaccag 


tgtcccagct 


caaagacagg 


cagagaaagc 


aaattctctc 


56040 


45 


ttgctcaggc 


tttggtttta 


tttaagccct 


cattggagac 


attgagagta 


acactgggtt 


56100 


ttaccaccgt 


tcatggaata 


agatttaaag 


acactattgt 


tatttataat 


taaaaaaaaa 


56160 




aaagaaaact 


agaggatctg 


gagactccta 


agcaggtgac 


atcttgccac 


aacagttaac 


56220 


50 


tgctctagct 


ctgctcccag 


agttgttctc 


tccagagtat 


ggatatggat 


ctttttaaag 


562 80 




gaaggcttta 


acagcagaga 


aaaataagag 


aaacttagaa 


acaaagaaaa 


ttgactttca 


56340 


55 


gctgcagtaa 


cagaagaatg 


agagattaaa 


tgaggcccac 


ccatattggg 


gagggcatct 


56400 


gctttattca 


gtttaccaat 


gtaaaaacct 


catccagact 


catcctcata 


gacacactaa 


56460 




gaataaagtt 


taaccaaaag 


tcttggcacc 


tattgttcaa 


tcaagatgat 


gcataattta 


56520 


60 


accatcaaaa 


ggccatcctt 


tgtgaacctg 


acactcatat 


gcatctcctt 


aagccatact 


56580 
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taatctccaa ataaagacta aaaggtgata 
atacaactgg aaatgtacca acctcttctc 
5 ttactcttct ccttactgtc tagcaacttc 

caatactata atataaacat gccttatgtt 
gtgtgtgtgt gtgtgtgtgt gtgtagtttc 

10 

ttataactac cttattctac tacccattct 
gctggccatg gttctttacc tggtagaatg 
15 caattagtgg tcctgcctgg actgagttgt 

atcatggtaa tactaagaga tgcatgaagc 
ttatgtccat tgtggaatag tggtctaatt 

20 

tccagcacag taaccccttt cagttatctg 
cagcaataaa cctataggag cctaccttcc 
25 attgacactt caagcaccat tggatctgtt 
cacggaagcc gaggcctgtt gtgaagcttt 
aagtcacttg gcaaattggc cagagcaaca 

30 

atccaaagaa actcattaga tgctgtgcct 
cttatcctca tgctggaaga gattctcaat 
35 tcactgaggt acaagaccct gaggttttat 

atatcttatg aataaatcta gagtagttac 
aatgtcatca acctaatgga ccagtgtgat 

40 

cctgtgaact aaattatgac atagagctat 
gaatgtgtat tgctggcctt gtcatctgaa 
45 agctgtccag aaaaagcatt tatcagatca 

ttaatttgct caagtaatga aaccacattt 
aattggtaat ccactgtcat cttccaagat 

50 

aagttgaata ggaatgtggt gggaatcacc 
tcattaatct ctgtagcccc tccaggaatg 
55 agttactgat ggatttatta atttctgcag 

ttactatttt cctaggcaga ggcagtacta 
ggtccttact ccacagagta agaaaataat 

60 

gttttaatta tgcattctgg aattggggga 
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attcttcata 


atatgttaca 


actatcatgt 


56640 


agaagaaggg 


gtaaagtctg 


tgaatgatgt 


56700 


tatgatttaa 


attgtatgat 


gtaaagttaa 


56760 


acatgattag 


agaatgagaa 


agaacagatt 


56820 


tataactgga 


tatgtggacg 


taactcatat 


56880 


gtagtccctt 


tgccttcagc 


aaacacctta 


56940 


acccaaacct 


tcattcctga 


aaagcctgga 


57000 


tgtggttttc 


ttaattgacc 


ttaatcacag 


57060 


gatctcctat 


attccaagca 


tagtgttccc 


57120 


tccccttagt 


catcagaaac 


aatcacccca 


57180 


aagaggatgg 


aagggctttg 


ctccaaaatt 


57240 


cagaagctct 


atatagcatc 


cctatctgcc 


57300 


gatggtatgg 


tccaagcaga 


agtcagctta 


57360 


tcttctggtc 


ctactcaaaa 


gcagctctgt 


57420 


cacccaagtg 


aggaatatgt 


tgcctccaaa 


57480 


cctattcatt 


gtggaagcca 


gatgcgacaa 


57540 


attccccaca 


ccacttgatc 


cctagaaatt 


57600 


cagatttatt 


ttccacactc 


tggcatgtaa 


57660 


tacgtcttat 


tcgctaggtc 


caatcagcat 


57720 


atcttgtgga 


aggaaagggc 


aattaagatc 


57780 


aaagttgata 


tacccccgag 


gtatgacagt 


57840 


agcaaactgt 


tcctggtggt 


ctttactaat 


57900 


ataacattat 


actgggtact 


agaggatgtg 


57960 


ggtaaggcag 


ttgcaattag 


agtcacctcc 


58020 


ccatctgtct 


tctgcacatg 


ccaaacatac 


58080 


acccctgcaa 


ctttcaagtt 


actgatggtg 


58140 


tgttgggagt 


caccaaccct 


gcatctttta 


58200 


taccatcagg 


aatgtggtga 


tgcttttgct 


58260 


gaacttccac 


ttggtattcc 


ccaccataaa 


58320 


atgagatcct 


accacctatt 


gagtatattt 


58380 


ataaccgcag 


gatgggtttg 


ggagcccact 


58440 
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gcacccattg 


tgagattaac 


cagagataaa 


5 


cttactctga 


gtggtaagcc 


acagtgatgc 


gaatcagtgt 


ctagtagtcc 


ctgagaggtc 


* 


ccctggaaaa 


aggctgcagg 


tccctttgag 


10 






rrt"tccttaa 




gttctttctg 


ggtttgtaaa 


ttttattgca 


15 


gattgagggg 


ctatgagtct 


atatttttat 




agaacttttc 


tgcttatata 


gatcaagggg 




tctaggaaca 


ccataatcaa 


ctagcccaca 


20 






U. w LA t* V^> l_J. \w* C— *. <J 




cagtctatga 


cctggtcaag 


ttgacacata 


25 


cagtcttggg 


gccacaatct 


ggaatgagtc 




agctgctggg 


atgcaaatgt 


atactaagca 




acaaaaatgt 


attattttaa 


aacatggatc 


30 


LCI L u ct Lyoo. 


ct v_» ^ l» v*. L^>a < — • - 


tataattatc 




tttctgaaag 


gctgttactg 


ttagtgaacc 


35 


cagatctcct 


ttgcagccat 


aatatcctcc 




ggaaattctc 


atctctcact 


ctgcttcccc 




tccctgtggt 


ccctgtgatt 


tttttttttt 


40 


^ o o t~ ot~ aa^ ci 
ololcl y _3 


t~ fhctaatca 

V— l_ V_. >w O ^-J ^ U- w t* 


aaaaattaaa 




tgtcaaggca 


aagctctgaa 


gcatcaaatt 


45 


ggatcataat 


tttgacccag 


caccattact 




tttccgtgcc 


taagttgagt 


agtactcagc 




agaatgtata 


gtgtttaaaa 


tatctgtcac 


50 


aaacatctat 


aaagctaatt 


cttttactag 




tctaatttat 


gccaccaaag 


tatctctctt 


55 


tattgagtga 


actgcatccc 


actctaaatt 


tatctggaga 


cgggcttttg 


aagctgttgg 




catgacggga 


ttagtgtcct 


tatatgaaaa 


60 


ctgtgtgtgt 


gtccttgtgt 


gtgtgctcac 
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actccatgga 


tcatcggacc 


tccacaacca 


58500 


tttgagtctc 


ctggaattag 


tatcaattca 


58560 


tggttatttc 


cttttcccca 


atgctcagta 


58620 


gaaggctgtg 


agaaagatca 


acagtacaaa 


58680 


ggagacctgg 


tctccccttt 


cattcaagga 


58740 


ggtctgtaaa 


ctggctcagg 


tctgggaatg 


58800 


gattcaggtt 


aggctttcat 


tcaatgacct 


58860 


taatttaaaa 


ggatttctgt 


ctattttacc 


58920 


cccacaccat 


aggtctgatt 


caaatattaa 


58980 


ccaaaataat 


gttttaatta 


agtatttgag 


59040 


aagttaacca 


tcgtattttg 


tttatactca 


59100 


ttagtcgaaa 


tcacctgcac 


agatcagagg 


59160 


agtactcatt 


gagatatagc 


attagcagga 


59220 


cagagatctt 


ctctggaaaa 


taataatcta 


59280 


atgcctagaa 


aggcatgcag 


gcccatcaca 


59340 


acacagacac 


aaagagtcct 


gaggaaagtg 


59400 


atcagttcac 


aaggtactct 


ttggacattt 


59460 


cactaacaca 


tacacacata 


cacctttctt 


59520 


tttttacttc 


tctgacctac 


cctaggattc 


59580 


gttttatgca 


tcactggaga 


aagctttagt 


59640 


gagtacatgg 


aatctaaagg 


taaagcaact 


59700 


agtgggatac 


tttgggaaag 


tttcttaaaa 


59760 


acagggttgt 


caagaagatt 


ggataattaa 


59820 


ggagtagatg 


ctcaattttg 


ttccataata 


59880 


tcggtattac 


ttcttatggc 


ataaacttga 


59940 


tattctgaat 


tatacaaatt 


atctccctgc 


60000 


catttgttga 


agttctagct 


cacagtgtga 


60060 


gtttagatga 


ggatatgaga 


gtgaatccct 


60120 


gacaccaggc 


ctctgtctcc 


tgctcttttt 


60180 


atgcctgtgt 


acacacaaag 


gaaagatgat 


60240 
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gtgaggacag aaataataca gtcatctgca 
agttagctga cactttgatc ttggaattcg 
5 tgctgtttaa atgaagcagt catggtaatt 

accttttctt cttgcaaatt gggaaaggga 
atgaaacaaa gaaacttttt gttctttaaa 

10 

tatctaatac ctatgtgtaa gtccgaaatt 
agtaggaaat ccctgttgct gcctttgtag 
15 ttaaagtatg ctatatatgc aagactgtgc 
aatgaaaacc agcaggacag tgcagtgaga 
atgcagagaa aactaagaat tcaatagatg 

20 

cgatagaagc cccagtggtt ccccttccct 
tggagacatt aagagtgaca ctgggtttta 
25 ctattattat ttataaactt taaaaacaaa 
cgggtgacat ctggccacaa cagttgactg 
caaagtatgg atatggatct ttttaaagga 

30 

acttaaaaac aaagaaaatt gactttcagc 
cagtgttttt cttggaactt tccagcaagg 
35 taatcccatt gactatggtt cagagaactt 

aatacttagc aataagaagc ttgaaattaa 
acatattata tattattcca taaaccacat 

40 



50 



aggttttggc 


agattggaaa 


ataagaagaa 


tgaacagata 


catgtcatca 


atgtataaag 


tggagttcca 


cagccaccct 


gaaatggaga 


ggcacggtgg 


ctcatgcctg 


taatcccagc 


aggtcaggag 


atcgagacca 


atctggctaa 


caaaaaaatt 


agccaggcat 


ggtggcgggc 


ggcaggggaa 


tggcgcaaac 


ccaggaggca 


gcactccagc 


ctgggcaaca 


gagcaagact 


tgcactgatt 


cagtgtgaca 


tttccagtat 


ctataaagaa 


tcaacaaaaa 


atgaactttg 


tatctgctca 


ccatgctaaa 


aatggcaata 
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agccaagaag 


agagctctca 


taaggaacca 


60300 


cagcctccaa 


aagtatgaga 


aataaatttg 


60360 


taccatagca 


ccctaagcag 


aataatatac 


60420 


gtgaaggcat 


aaaaatccca 


tctgaccaca 


60480 


cataaaattc 


ctaggtagtc 


acaaagggaa 


60540 


agtaattaga 


gtgtttatgt 


cggaagatac 


60600 


gttctgtaga 


atggtcttct 


aatcttattt 


60660 


tactactacc 


ttagaggagt 


gaagaatcag 


60720 


aaaacagtca 


tcacttacag 


aagcccagac 


60780 


cagaaaatca 


gagcttgaga 


ccactccaca 


60840 


cgtattcgct 


tgctccttcc 


aagaaatgtc 


60900 


ccattattca 


tggaataaga 


tttaaagaca 


60960 


agaaaactag 


aggatttagg 


gactcctaag 


61020 


ctctaattct 


gctcccagag 


ttgttctctc 


61080 


gggctttaac 


aacagagaaa 


aataagagaa 


61140 


tgcagtaaga 


gaagaacgat 


agaaagaata 


61200 


gaatcagacg 


agaggctgtg 


aggaaaggcc 


61260 


ggattcctca 


gatcatcata 


acttctctgg 


61320 


ggctgaattt 


cagaacatta 


atgaatacaa 


61380 


taagactaga 


gtaatttgca 


ttaaaagagt 


61440 


atatgaggat 


atggaaaatt 


tttgaagtag 


61500 


gggcctaggg 


catttattag 


caggaggcca 


61560 


aattgtaaaa 


taacgtctgt 


actggggccg 


61620 


actttgggag 


gctgaggcag 


gcagatcaca 


61680 


cacggtgaaa 


ccctgtctct 


actaaaaata 


61740 


acctgtaatc 


ccagctactc 


aggaggctga 


61800 


gaactttcag 


tgagccgaga 


tcgcaccact 


61860 


ccatctcaaa 


aaataaaata 


aaataatgtc 


61920 


tatagctaag 


ctcatagaaa 


cacatttaac 


61980 


gaaagagtgc 


aaacataaag 


caagacaaag 


62040 


ataagagaaa 


taatacttat 


tactcttatt 


62100 
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tttattacta 


ctacaattta 


cttagcatct 


5 


acaataactg 


tataacaatt 


tttattctca 


taaattaata 


gataacttag 


cctagacccc 




attctaggtt 


tctcttctcc 


aaagagtaag 


10 


taaagaataa 


taaaaaacaa 


agaacaataa 




agatagctat 


tataacactg 


tgctagtcat 


15 


gggaaggtat 


agagcaaagt 


taaaacctga 




gtatgttgca 


acatagtgtg 


cttaacttag 




atgaaggata 


atcaccatta 


tgattcaacg 


20 


gttcagagaa 


aagaaattgt 


qaqqqaqqqq 




ggtaagtggc 


tagaggctgt 


cacatcatga 


25 


ggaaacacca 


gaaaatgaca 


aggtacagtt 




tctttcttgt 


cctttcaaga 


tgaaagtaca 




ttttcttacc 


cctcgaatct 


gggttgtcta 


30 


acacatgact 


cagcagaggt 


ttgaaaagta 




ccaactgcca 


gagacatgac 


tgaggccatc 


35 


tgaccacagt 


tgcatgagac 


agcctggtag 




accacacaca 


ggacccacag 


atgtgtaaac 




acattttgag 


ggatttgtta 


cacagctaaa 


40 


gtgacaggaa 


aaatggatgt 


tatctatact 




gagagatttt 


gcgttctacc 


tgtgtctgaa 


45 


ttgccatttg 


gtaactaaaa 


tgtccttttt 


gctggagtgc 


agtggtgcaa 


tttcagctca 




ttctcatgcc 


tcagcctccc 


gagtagctgg 


50 


aattttttat 


atttttagta 


Q aQ a t qqqq t 




tcctgacctc 


aggtgatccg 


cccgcctcag 


55 


gccaccatgc 


gcggtcccta 


aaatgtcttt 




atgtatatat 


gaggaatcag 


aaaccaggag 




aaactcctaa 


ctaaaccagg 


actacttgtc 


60 


tacttatagc 


catcttggga 


tcttgagcag 



actacactcc aggtacttta aaatttactc 62160 

taaaggaaat gtaatatttt aataaatatg 6222 0 

ataaatagta aataaaaggg cttggggttc 62280 

tcttccactg catcaccaga ttgcatgaat 62340 

aaaattaaag ttatatttct gccatttgtc 62400 

catgcttaga gggagaaaaa acaaaaactg 62460 

gtaattaggg tgaaatggtg ttttataaaa 62520 

ggaatgctaa cacaacatca gatgatgttc 62580 

tccatagaaa actgaacaag agatggatac 62640 

tggcctttta aaatattcta actggcaact 62700 

ggttcaatta aggaagacct gatctcaata 62760 

aaaaacagac acaaattgtt cccttccctt 62820 

gatcctccca tgaagacatt agggggccta 62 88 0 

tgaaatttga ttggccaata tgacattaac 62 94 0 

tttgtgaatt ggaagagaac caagagtctg 63 00 0 

ctaattcatg taattggaca ctccaccagg 63060 

agaagtgctg agcagcctta gatgagacag 6312 0 

aaaagccgat ggtggttgct ggaactaact 6318 0 

gctacttaaa tcaggtggca aaaccacaaa 63240 

tcaagaagtg gttaatttgt tccgtggcct 63300 

gatatggaac tggaataaat gtgtatctag 63360 

tttgagacag agtcttgctc tgttgcccag 63420 

aggaaacctc cggctcctgg gttcaagtga 634 8 0 

gactacaggc acgtgccacc acacccggct 63 54 0 

ttcaccatgt tggccaggct ggtctctaac 63600 

cctcccaaag tgctgggatt acaggcgcga 63660 

atatgtagat catctagtcc aatttggtca 63 72 0 

agggggatgt gacttgccaa aagtcacagt 63 780 

aggttctctg atccccagtc ccaggcattt 63840 

tcctgagatt ctatgactct gtgactctat 63 900 
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ggttctataa ttccaagaca aagcaaacaa 
aaaagaacat ggtctagact gaagtaccaa 
5 caccatcatg gattcaagca ccgcacacag 

cactgcttca gaatatgagt ccacagaact 
cttgcaaaaa ttatttgcta gaaaaatgaa 

10 

tgtatatttt actgaggcag gggaaaggct 
ggtaggagcc tattccaatt cttctttaag 
15 tttgagagag gatgtttgag aagaacgaat 

atcatttagg actcacctac ttaaaaagtg 
cagactgaga aggtggactg agaagaagag 

20 

atattgtata ccttccattt ttgtatattg 
accactgctt ggtgtcacag tgcatcacat 
25 aaaaacccga agagatggag agtcctctgg 
tcagagaatt ggccatctcc aggcagggct 
ctggtatact catttagaat tcagtgtttc 

30 

tgaagaactt atgaagacac aggtttccca 
ggatagggcc tgatagttcg catttctagc 
35 ggaccacatt ttcaggagca ctgatttaga 
aggaattaag aaagtaccca ggagcatcat 
accaaagaga gtaattgcca aatccttttg 

40 

aactatcact ttgcttttcc tcaccatgac 
ctgtttggaa ttatgacctt ttcttttgga 
45 tatccaaggt ttccctttat atttctttca 

agtatagtca atcaagttca atttgaagcc 
caatgaaata agctagatcc agttgtatga 

50 

tgttgaaatt atttccccat tagttcatca 
aagttccttt gggaatactt cttttgaggc 
55 ttgaaattag ttactattta ttataagcca 

cccaaagacc tctcagtcta gcaagaaaac 
taaatgttat cacagaggtg caggaaatgc 

60 

ttctttacag aagtgaagtg ggaagggttt 
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ttccagtgct 


ccaggcagcc 


tcagcacaag 


63960 


ctaaatcatc 


tcctttcaaa 


ttatcaccga 


64020 


tccggtgttt 


ctggtatttc 


ctccagaaat 


64080 


ttcagccacg 


accttttcaa 


ctcaaagccc 


64140 


aatcttaggg 


gtaagtaaga 


cttgccccta 


64200 


agggaaatta 


tctgttggca 


catgtttgaa 


64260 


acaatgtgaa 


tatagagtgt 


tggtgggcca 


64320 


catgtgggtc 


tggacttgac 


aaaaaatgct 


64380 


atagttgcta 


tggcagtaaa 


tgagattact 


64440 


agcagagcac 


atagactcac 


acgtttttgt 


64500 


tcctcattag 


tctttcccag 


cacttgtcaa 


64560 


gagtggctca 


gagagggaga 


gaaaaaaaaa 


64620 


gcctgagagt 


gtgacaatac 


ctcaagaggc 


64680 


cctcacagtc 


caattctgct 


tgaggccagc 


64740 


tcagacattc 


acatttatgg 


gctaaagaac 


64800 


gcctcacctg 


gagaccgttc 


agtaagtatg 


64860 


tcccaagtga 


tcctgatgtt 


gccaatccat 


64920 


tcattttacc 


tatcaccaaa 


taccagatag 


64980 


aaagggactt 


ctattattct 


tcccctaatt 


6-5 04 0 


accaaaagaa 


gatattctaa 


acagaaggga 


65100 


tttttcctct 


atttgcagac 


tatccagatc 


65160 


gttatcttcc 


ttttcacttt 


gttaaaacca 


65220 


ggatatccat 


tctggggctc 


tgttttggtg 


65280 


atgccaacca 


ggatgttagg 


gaattcttca 


65340 


catgctttca 


aaagtgcaaa 


aaactaacct 


65400 


cagtcactga 


tattccatca 


ttaggtcttg 


65460 


ttgcagttgt 


ttatttggct 


attcattcat 


65520 


gatacacaga 


gaaaggcaat 


gggcctctcc 


65580 


atgtaaacag 


gtcacttcaa 


gtcagtgtga 


65640 


ttatgtaagc 


agagatgaag 


gagcaattta 


65700 


ctcagatata 


ccttttgagc 


tgcggagagg 


65760 
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tatttattaa acccagtgta gtcacaatca ataaacatta acttaggacc taatttagtg 65820 

ccagagaatg ggcagagcaa tggcttcagg gaaaagaaag cagtgcctga gctcaaggtc 65880 

5 

tcagttgagt gaggagtcag acaattaaca cagggtgtga tgagggagaa taaacacagg 65940 

ggctgcctcc ataggaaagg catcaacaaa acagagggga caacgcccag agacatttcc 66000 

10 agcttgacca agttttaaag ggtgaatggt taatcaggtg aggacacaca cacagaagat 6606 0 

gggagttggg atacccaagt gtaaggatta gggatgtctt gttgcatgcc aagggagaaa 6612 0 

tatgagcaca ggctgaaggt atgggagagc attttctatt atgtgaacta taagagttca 6618 0 

15 

gcacagggag tgaggatgga gagatgaggc tggaaaatac ccatgaaggg gctggaaatt 6624 0 

taaacatgga agtgctgtaa tcagatttga aatttccact agacaacaat gctaccacgt 66300 

20 gtgacccact tccaagaaaa atcttggaga ttttaagaac taccagcaca ttcactgcaa 66360 

ccccacctta tgttcccatc caagagaaca cactggattg cagtaagatg gaatccaatc 6642 0 

ttaatgatta taaactgact tgaaatccct ttgtcatgag atcaaagctg agtccacaat 66480 

25 

agctataaaa agaacatgga tgagtaaacc aaagccctgt acggttaaaa aaaatgttca 66540 

tgttcaaata aatagttagg gctaaaggag atttccagac cagtgttttg tccactattc 66600 

30 catttgtgga aagtcatata aagcaactca cccttctcac tgccctgagg agtgtgtgaa 66660 

cttagaaggc gggcctgtaa aagaaatgca tagaggtcta ttctgtaaga actcaacatc 6672 0 

agctaaacat ggtcatcatt aacatattta ttcctcttaa cagttcatta attctggagc 66780 

35 

cttcctaatt gcagtgaaaa gaaaaaccac agaaactctg gtgagttata ttcttacttt 66840 

attaaaaata tattttgtag ccagtcatgg tggctcatac ctgtaatccc agcactttgg 66900 

40 gaggccaaga cgggtggatc atgaggtcag gagttcaaga ccagcctggc caacatggtg 66960 

aaaccccatc tctactaaaa atacaaaatt agccaggtgt ggtggcacac acctgtagtc 6702 0 

ccagctactc aggaggctga ggcaggagaa ttgcttgtac ctgggaggtg gaggttgcaa 67080 

45 

ggagccgaga tcgtgcctgg gcaacggaaa agactccatc tcaaaaaaaa aaaacaaaaa 6714 0 

aaagtaatga gttacatttt cactatttcc actttattaa aaatatataa ctttcaaata 67200 

50 atttatcctg gccttgaaaa aaatttgtaa aatcattagg tcaccataat tccacaaact 67260 

atcaagtatt aataccaata aacagcattt gcttcctgtt ttcctagaat tataacccat 67320 

tatgctagag gagaaagttt ctctaggtca tctagtttag ctcttatttt atgtatgaag 67380 

55 

acactgaggc ttacaatcgt taaatagctt gtccaaattc ccagccaatt aatggtagaa 6744 0 

ttatttcaac attccagtta cttattcatt cttttttgtt ttctgggtat ttacggcata 67500 

60 ccaggcactt tgacaggcac tactttgact catacgaaag gcaaactgta gacaatctaa 67560 
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ttaaaccata aacagaatct caaaataaag 
ataaattata agtagaatat ttttacttta 
5 atgccaacaa aagattttag caaaggactg 

attaaaaatg gcagagaagt gcagcattca 
taaacagacc atgactttca ccagaaaatc 

10 

tcccaactga ataagagtta acttaggcat 
gttcacaaga tccccttatt gtgtagatga 
15 cgcacaggat cacagagctg cctcgtggca 

cattctccgc actcattgct ttgtttgtag 
ttcttattct atttcagata atattgagcc 

20 

caatagctgg aatcattctc ctcacatttg 
gttattctca ccaaaatagt cagtgtaagg 
25 tgcatttata gagtgatgag taaagggctt 

caccattcag atcatgttgt ggacacatgt 
tccataatat ttgatagaca ttacaccagg 

30 

aactcccacc tttaggaatt cacaggagca 
atacaaatat aaatctaaaa aaaaaaggta 
35 atgtacatta gaatcacagt ggtgtgtttt 

aacttccgat aaagcatgat aagattgaaa 
accactaaga ttagtgtaaa agaatgtaaa 

40 

gttaccaaac acacacaaaa attgctgagg 
agggggaagt catattagaa gatatttctt 
45 gaatgcacca ggtccacaga aagcaggact 

gtaatggtta agtatggcta tgtggattaa 
aaaaaacaaa acaaccttaa agacattaaa 

50 

caatttttag atgaaagcct gcaacctgaa 
agctacttat gccttcagac ccatttgcca 
55 ccatgtggac tgaaagagaa agaagtcaag 

aagagagatt ttctccgtat taaactaggt 
gggcaaaaag atatacaacg taccctgccc 

60 

aaaaggaaaa gtatttgaga agtgtggcca 



PCT/US01/48437 



aatacagtgc 


tactgaatgt 


aaagttatag 


67620 


ctcaaagaat 


tgaagacata 


agatattact 


67680 


aaaatcctta 


gtttgtcatt 


ttagggagaa 


67740 


gttaaactta 


ggagtcttaa 


aataaaaatg 


67800 


aaaatgcaat 


tgttacaggc 


acacccttcc 


67860 


agaatatggg 


gaatggaaag 


gttactaaaa 


67920 


aggcctgaag 


cccaaagatg 


tgaagcaatt 


67980 


ttaagatgct 


gtgactaaaa 


tcctgctctc 


68040 


tacaggctgg 


acctcagtca 


cattgttatg 


68100 


gaataatgaa 


ttttcttagt 


gccctgggag 


68160 


gtttcatcct 


agatcaaaac 


tacatttgtg 


68220 


ctgttactgt 


cctgttcttg 


gtaagtatgt 


68280 


aatagaaaat 


atttattagc 


actcaatcag 


68340 


attttctttt 


actgacaaac 


atttatgaaa 


68400 


cattgtggaa 


gagaaaaata 


aacacattat 


68460 


caagagataa 


aacatacaca 


aatcagtatg 


68520 


cagcagtatg 


gctgtagttt 


atagccctat 


68580 


tacagttctt 


gatttggttt 


taacaggctg 


68640 


aggggcattt 


attttagctg 


ccttcaaaat 


68700 


aggtttgatc 


cacatggaga 


gaaaaagaga 


68760 


aaattgaaaa 


cccaccatct 


aaggcaaagc 


68820 


acagcagagc 


ctaagaaata 


cttaatattt 


68880 


aaagagcaga 


aaatagggag 


actgattttg 


68940 


ctcttctctg 


aaaacagaag 


tctttttatt 


69000 


aagctgaaag 


gtcagctgga 


actactagat 


69060 


atttacaagt 


ggaactctga 


agagcatgaa 


69120 


gtgatgctta 


tttggaatcc 


aatccctggc 


69180 


gtttcacatc 


cactcaaagt 


ataaagactt 


69240 


ctttgatagg 


ctactacgct 


cttcaggtaa 


69300 


caaaatagca 


aagtcgctga 


acaaagttta 


69360 


caagccaaac 


aaaccccttg 


acaacctcct 


69420 
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tgcacgcccc aatccccact gacctttgct 
tacctgaatt tggtcttggt atgtggtgcc 

5 

ctatctagga gaaaaacatt cattaatctt 
attaacagta agcactcaaa agaggcctaa 
1 0 atagaaacag aacatagcaa aaaaaaaatc 

gaagagattt taaaaccaca atatttatca 
ccaagatggc caaataggaa cagctccggt 

15 

acgggtgatt tctgcatttc catctgaggt 
cagtgggcgc aggtcagtgg gtgcgtgcac 
20 gcctcacttg ggaagcgcaa ggggtcaggg 

acggacggca cctagaaaat cgggtcactc 
ttaaaaaacg gcgcaccacg agattatatc 

25 

cggagtctcg ctgattgcta gcacagcagt 
ctgggggagg ggcgcccgcc attgcccagg 
30 ctccaactgg gtggatccca cgacagctca 
ctctgggggc agggcacaga caaacaaaaa 
ccctgtctga cagctttgaa gagagcagtg 

35 

agactgcctc ctcaagtggg tccctgaccc 
ccccccagca ggggcacact gacacctcac 
40 gagggtcctc tctgttagaa ggaaaactaa 
catctgtaca tcaccatcat caaagaccaa 
aacagaacag aaaaactgga aactctaaaa 

45 

gcagttcctc accagcaacg gaacaaagct 
gaggcttcag acgatcaaat tactctgagc 
50 aggttgaaaa ctttgaaaaa aatttagaag 

agtgcttaaa ggagctgatg gagctgaaaa 
gaagcctcag gagtcgatgc gatcaactgg 

55 

tgaatgaaat gaagcgagaa gggaagttta 
gcctccaaga aatatgggac tatgtgaaaa 
60 aaagtgatgg ggagaatgga accaagttgg 
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aggcataaac 


tggataaggc 


aagaatgttt 


69480 


cctggtacct 


ggcagaagca 


aaaccgaagt 


69540 


aataaattcc 


catcaaattt 


tccattgaaa 


69600 


aaagaaataa 


gacaatgcga 


ataacaacta 


69660 


acctgtaaat 


ttctgatact 


aagtatatta 


69720 


tgttttaaga 


aataaatgag 


gggggaggag 


69780 


ctacagctcc 


cagcgtgaga 


gacgcagaag 


69840 


acccggttca 


tctcactagg 


gagtgccaga 


69900 


cctgcgcgag 


ccgaagcagg 


gcgaggcatt 


69960 


agttcccttt 


ctgagtcaaa 


gaaaggggtg 


70020 


ccacccgaat 


actgcgcttt 


tccgacgggc 


70080 


ccgcacctgg 


ctcggagggt 


cctatgccca 


70140 


ctgagatcaa 


actgcaaggc 


ggcagcaagg 


70200 


cttgcttagg 


tatacaaagc 


agctgggaag 


70260 


aggaggcctg 


cctgcctctg 


taggctccac 


70320 


gacagcagta 


acctctgcag 


acttaaatgt 


70380 


gttctcccag 


ctggagatct 


gagaacgggc 


70440 


ctgacccccg 


agcagcctaa 


ctgggaggca 


70500 


acggcagagt 


actccaacag 


acctgcagct 


70560 


caaacagaaa 


ggacatccac 


accaaaaacc 


70620 


aagtagataa 


aaccacaaag 


atggggaaaa 


70680 


agcagagtgc 


cactcctcct 


ccaaaggaat 


70740 


ggatggagaa 


tgactttgac 


gagttgagag 


70800 


tatgggagga 


cattcaaacc 


aaaggcaaag 


70860 


aatgtataac 


tagaataacc 


aatacagaga 


70920 


ccaaggctcg 


agaactacgt 


gaagaacgca 


70980 


aagaaagggt 


atcagcgatg 


gaagatgaaa 


71040 


gagaaaaaag 


aataaaaaga 


aatgagcaaa 


71100 


gaccaactct 


acatctgatt 


ggtgtaccta 


71160 


aaaacactct 


gcaggatatt 


atccaggaga 


71220 
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acttccccaa tctagcaagg caggccaacg 
aaagatactc ctcgagaaaa gcaactccaa 
5 aaatgaagga aaaaatgtta agggcagcca 
agctcatcag actaacagtg gatctctcgg 
ggccaatatt caacattctg aaagaaaaga 

10 

aactaagctt cataagtgaa ggagaaataa 
attttgtcac caccaggcct gccttacaag 
15 ggaacaactg gtaccagccg ctgcaaaatc 

gaagaaactg catcaactaa cgagcaaaat 
attcacacat aacaatatta actttaaatg 

20 

acagactggc aaattggata aagagtcaag 
atctcatgtg cagagacaca cataggctca 
25 caaatggaaa acaaaaaaag gcaggggttg 

aatcaacaaa gatcaaaaga gacaaagaag 
aacaagaaga gctaactatc ctgaatatat 

30 

taaagcaagt cctgagtgac ctacaaagag 
actttaacac cccactgtca acattagaca 
35 cccaggaatt gaactcagct ctgcaccaag 

acgccaaatc aacagaatat acattttttt 
accacatatt gggaagtaaa gctctcctca 

40 

actatctctc agaccacagt gcaatcaaac 
actgctcaac tacatggaaa cagaacaacc 
45 aaatgaaggc agaaataaag atgttctttg 
agaatctctg ggatgcattc aaagcaggat 
acaagagaaa gcaggaaaga tccaaaattg 

50 

acaagcaaga gcaaacacat tcaaaagcta 
cagaactgaa ggaaatagag acacaaaaaa 
55 ggttttttga aaggatcaac aaaattgata 

gagagaagaa tcaaatagac gcaataaaaa 
cagaaataca aacgaccatc agagaatact 

60 

atctagaaga aatggataaa ttcctcgaca 



ttcagattca ggaaatacag agaacgccac 71280 

gacacataat tgtgagattc accaaagtgg 71340 

gagagaaagg ttgggtaacc ctcaaaggga 71400 

cagaaaccct acaagccaga agagagtggg 71460 

attttcaacc cagaatttca tatccagcca 71520 

aatactttac agacaagcaa atgctgagag 71580 

agctcctgaa ggaagcacta aacatggaaa 71640 

atgccaaaat gtaaagacca tcgagactag 71700 

aaccagctaa catcataatg acaggatcaa 71760 

taaatggact aaatgctcca attaatagac 71820 

acccatcagt gtgctgtatt caggaaatgc 71880 

aaataaaagg atggaggaag ttctaccaag 71940 

caatcctagt ctctgataaa acagacttta 72000 

gccattacat aatggtaaag ggatcaattc 72060 

atgcacccaa tacaggagca ccaagattca 72120 

acttagactc ccacacatta ataatgggag 72180 

gatcaacgag acagaaagtc aacaaggata 72240 

cagacctaat agacatctac agaactctcc 72300 

cagcaccaca ccacacctat tccaaaattg 72360 

gcaaatgtaa aagaacagaa attataacaa 72420 

tagaactcag gattaagaat ctcactcaaa 72480 

tgctcctgaa tgactactgg gtacataatg 72540 

aaaccaatga gaacaaagac acaacatatc 72600 

gtagagggaa atttaaagca ctaaatgccc 72660 

acaccctaac atcacaatta aaagaactag 72720 

gcagaagcca agaaataact aaaatcagag 72780 

ctcttcaaaa aattaatgaa tccaggagct 72840 

gactgctagc aagactaata aagaaaaaaa 72900 

atgataaagg ggatgtcacc accgatccca 72960 

acaaacacct ctacgcaaat aaactagaaa 73020 

catacactgt cccaagacta aaccaggaag 73080 
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aagttgaatc tctgaataga ccaataacag gagctgaaat tgtggcaata atcaatagct 73140 

taccaaccaa aaagagtcca ggaccagatg gattcacagc cgaattatac cagaggtaca 73200 

5 

aggaggaact ggtaccattc cttctgaaac tattccaatc aatagaaaaa gagggaatcc 73260 

tccctaactc attttatgag gccagcatca ttctgatacc aaagccgggc agagacacaa 73320 

10 ccaaaaaaga gaattttaga ccaatatcct tgatgaacat tgatgcaaaa atcctcaata 73380 

aaatactggc aaactgaatc cagcagcaca tcaaaaagct tatccaccat gatcaagtgg 73440 

gcttcatccc tgggatgcaa ggctggttca atatatacaa atcaataaat gtaatccagc 73500 

15 

atataaacag agccaaagac aaaaaccaca tgattatctc aatagatgca gaaaaagcct 73560 

ttgacaaaat tcaacaaccc ttcatgctaa aaactctcaa taaattaggt attgatagga 73620 

20 cgtatttcaa aataataaga gctatctatg acaaacccac agccaatatc atactgaatg 73680 

ggcaaaaact ggaagcattc cctttgaaaa ctggcacaag acagggatgc cctctctcac 73740 

cactcctatt caacatagtg ttggaagttc tggccagggc aattaggcag gagaaggaca 73 80 0 

25 

taaagggtat tcaattagga aaagaggaag tcaaattgtc cctgtttgca gatgacatga 73860 

ttgtatatct agaaaacccc attgtctcag cccaaaatct ccttaagctg atcagcaact 73920 

30 tcagcaaagt ctcaggatac aaaatcaatg tacaaaaatc acaagcattc ttatacacca 73980 

acaacagaca aacagagagc caaatcatga gtgaactccc attcacaatt gcttcaaaga 74040 

gaataaaata cctaggaatc caacttacaa gggatgtgaa ggacctcttc aaggagaact 74100 

35 

acaaaccact gctcaaggaa ataaaagagg atacaaacaa atggaagaac attccatgct 74160 

catgggtagg aagaatcaat atcgtgaaaa tggccatact gcccaaggta atttacagat 74220 

40 tcaatgccat ccccatcaag ctaccaatga ctttcttcac agaattggaa aaaactattt 74280 

taaagttcat atgggaccaa aaaagagccc acatcgccaa gtcaatccga agccaaaaga 74340 

acaaaactgg aggcatcaca ctacctgact tcaaactata ctacaaggct acagtcacca 74400 

45 

aaacagcatg gtactggtac caaaacagag atatagatca atggaacaga acagagccct 74460 

cagaaataac gccgcatatc tacaactatc tgatctttga caaacctgag aaaaacaagc 74520 

50 aatggggaaa ggattcccta tttaataaat ggtgctggga aaactggcta gccatatgta 74580 

gaaagctgaa actggatccc ttccttacac cttatacaaa aatcaattca agatggatta 74640 

aagacttaaa cgttagacct aaaaccataa aaaccctaga agaaaaccta ggcaatacca 74700 

55 

ttcaggacat aggcatgggc aaggacttca tgtctaaaac accaaaagca atggcaacaa 74760 

aagacaaaat tgacaaatgg gatctaatta aactaaagag cttctgcaca gcaaaagaaa 74820 

60 tcaccatcag agtgaacagg caagctacaa aatgggagaa aatttttgca acctactcat 74880 
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ctgacaaagg 


gctaatatct 


agaatctaca 


caaacaaccc 


catcaaaaag 


tgggtgaagg 


tttatgcagc 


caaaaaacac 


atgaaaaaat 


aaatcaaaac 


cacaatgaga 


taccatctca 


caggaaacaa 


caggtgctgg 


agaggatgtg 


gggactgtaa 


actagttcaa 


ccattgtgga 


aactggaaat 


accatttgac 


ccagccatcc 


aatcatgctg 


ctataaagac 


acacgcacac 


caaagacttg 


gaaccaaccc 


aaatgtccaa 


atatatacac 


catggaatac 


tatgcagcca 


ggacatggat 


gaaattggaa 


atcatcattc 


aacaccgcat 


attctcactc 


ataggtggga 



25 aggggaatat cacactctgg gaactgttat 
cgggagatat acctaatgct agatgacgag 
tgtatacata tgtaactaac ctgcacaatg 

30 

taaaaaaaaa gaaaaagaaa aaggccatta 
aagccaatag ttttccaagc tgacttagaa 
35 tatttgtcag cataacctac cttctcttat 

taaaaatgtc aagacttgaa aaaaaaaaag 
ggaatataaa ataaaagaaa gtgaactaga 

40 

ataaaaatag aaaattaaaa aatttaaaat 
gtttaaaaaa caaaacacaa ctgcattctg 
45 caaatagatt taaggcaaga agcattgcta 

agggtcgatt cacaagagaa atattacatt 
aaagccaaaa ggaaaagcta actcagctga 

50 

aaattttagc acacatttct cagcaactga 
gaattttgaa tacaattgat aaaattgacc 
55 caaataacta caattataat actgtcaact 

gtatttatca tgtataacat attttgaagt 
agctaattaa caaaggcatt acctcccata 

60 

attcactatc tttgcacttt taagaataca 
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atgaactcaa 


acaaatttac 


aagaaaaaaa 


74940 


acatgaacag 


acacttctta 


aaagaagaca 


75000 


gctcaccatc 


actggccatc 


agagaaatgc 


75060 


caccagttag 


aatggcaatc 


attaaaaagt 


75120 


gagaaatagg 


aacactttta 


cactgttggt 


75180 


agtcagtgtg 


gcaattcctc 


agggatctag 


75240 


cattactggg 


tatataccca 


aaggactata 


75300 


gtatgtttat 


tgcggcatta 


ttcacaatag 


75360 


caatgatagg 


ctggattaag 


aaaatgtggc 


75420 


taaaaaatga 


tgagttcatg 


tcctttgtag 


75480 


tcagtaatct 


atcgcaagaa 


caaaaaacca 


75540 


attgaacaat 


gagatcacat 


ggacacagga 


75600 


ggggtggagg 


gaggggggga 


gggatagcat 


75660 


ttagtgggtg 


cagggcacca 


gcatggcaca 


75720 


tgcacatgta 


ccctaaaact 


taaagtataa 


75780 


aaaaaaaaaa 


gaaaaaaaaa 


gaaaggagat 


75840 


actcgacaaa 


aaaaatgggt 


atatccttac 


75900 


tatctgtcag 


tctgaaatac 


tcctcaataa 


75960 


aaagaaatga 


taaactttaa 


aatacctcag 


76020 


agattttctg 


tgtaaaacta 


ttatgtaccc 


76080 


tttaaaaaaa 


gatggtccca 


gtaaataaga 


76140 


tttacaagaa 


acagctacac 


tagtaacaaa 


76200 


aacgtataat 


aggggacttt 


ctataataaa 


76260 


tctaggtttg 


aatgctccta 


ataacaactc 


76320 


atgagtagag 


aaagctaaaa 


tcatcataag 


76380 


aagaatatgt 


agacagacca 


taaaattaca 


76440 


taaatgtcat 


atatagaatg 


aacactacac 


76500 


gacattttta 


attgacagat 


aaaattatat 


76560 


acatatatgc 


attgtggaat 


gtttacatct 


76620 


gttatcactt 


ttgcagtgag 


aacacttaac 


76680 


atagccatca 


ttctcagcaa 


actaagacag 


76740 
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gaacagaaaa ccaaacacct tatgttctca ctcataagtg agagttgaac aatgagaaga 76800 

catggacaca gggaagggaa tatcacacac caaggcctgt tggggggtag tgggacaagg 76860 

5 

ggagggagag ctttagaaca aatacctaat gcgtgtgggg cttaaaacct cgatgacagg 7692 0 

ttgataagtg cagcaaacca ccattgcaca tgtatacctg tgtaacctgc acgtcctgca 76980 

10 catgtatccc agaacttaaa gtaaaataat aataaaaaaa gaatacaata tactgtcatt 77040 

aactatgggc accccgctgt acaacagatc tcttgaactt attcctccta tctaactaac 77100 

tgatggaagt gataagacaa tagaattaca tctttaaagt gctcaaacaa aataatgcca 77160 

15 

acccagaatt ctaggcctgg tgaaaatgtt cttcaaaaat gaaagtgaaa tagccaggcg 77220 

cagtggctca ggcctgtaat cccaggactt tgggaggcca aggtgggcag atcacttgag 77280 

20 gtcaggagtt tgagaccagg ctggccaaca tggtgaaaca ccatctctac taaaaataca 77340 

aaaattagca gggcatggtg gtgggcagct ttaatcccag ctactcggga agctgaggca 77400 

ggagaatcac ttgaacctgg gaggcggaag ttgaagtgag ccgggactgc accactgcac 77460 

25 

tccagcctgg gcagcagagc aagactccgt ctcaaaaaca acaacaacaa caaaaacaca 7752 0 

atgaaagtga aataaaaaca actgaaaaaa ttgacagatt tcacttagaa cattttcaag 77580 

30 aattagccat gtgctgagcc tacaagcctc aacaaaattc aaagaattga aatcatatac 77640 

agtatattgt ctgaaaacaa tggaattgaa ttaaatgtca attttaaaaa ggcaaaaaaa 77700 

aaatccccaa aactgttttg aaattctaag ttatacaagg ctcaaagaat ttacaacaga 77760 

35 

cataaaaaaa ttaaaaacaa aataatagaa atatttatat aaattgatta caactaaagt 77820 

gtgcttagag gaaactttac gacctcaaag gcaagtattg ggaaggaaaa tagactaaaa 77880 

40 agtaaattat ctaaatattc ataccaagaa attagattaa aaaattaaat ttaaagaaaa 77940 

tgaaagaatt gagatatagt attctataat agacataaca aagaatagag ataaaaacta 78000 

atgaaataca acacaaacca tggataagat aaaataaagc taaaatatgg atgtttgaaa 78060 

45 

aactaataaa attggtaaac ccctagatat ttaaaaagaa aaaagagaaa acacaaacta 78120 

tcattatcag aaatgaaggt gtatcactat caatacaata gccaagaaca tatgaggaaa 78180 

50 ttattaacat aaaaattatt tgatgcatta gatgattgtt tccttataaa acctacctac 78240 

caaaactatg caaaaccacc cgccatgact aacacaggaa aaaatgtata tatctgattg 78300 

ggcttatatc taataaaaaa tcaaatacat aatttaaaac tttctctcaa aaaagaaaaa 78360 

55 

gcaaagcctt gcaggagcag actaatttct tccaaacatt tagaaaagaa acaacaccaa 78420 

actgaatact ttcagttcac ttcccaactc acgagtattc atgtgaggct gtcacaagtt 78480 

60 tgacatgaaa acataaggac attataaaat atttaaaaat tacgggtaac tctatcttgt 78540 
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aaactttgtg aagaaatctg aaataaaata 
ataatatacc gtggccaagt cagtggatat 
5 catcaatcag tgatacacct aaatgtgaaa 

tgcagggaac ttaggagaat atcttcacga 
aaacataaaa gggactaaat ataggggaaa 

10 

aaagggagaa aatttcctct ctctatatat 
aatatattaa gcactcctga aaatcaataa 
15 gcaagcagtt cacaaagaaa gatagataaa 

cattattaat catcagataa atgcaaatta 
ccagaatcat tttaattaaa aggaccacca 

20 

ctggaactct catacccttt tgatggaaat 
aaaactattg ggtggttttt taaaaagtta 
25 agacaaaaat gtttatagca gtcctattca 
tctatcaaca acaaaatggc taattattca 
gaacaaccta ctgacacaga ggagcatgaa 

30 

aatttcagat aaaaaataaa caggcaaatt 
gttgactctc aatagggtgg gaggattgat 
35 tgataaaaat gtgatatatc ttgacgtgga 
tttttttttt gagatgaagt ctcactctgt 
cggcttactg caacctccat ctcccaggtt 

40 

tagctgagat tacaggctca tgctgccatg 
tggggtttca gcatgttgcc caggctggtc 
45 cctcggcctc ccaaagtgct aggattacag 

cttctttata tataaacttt gatatagtta 
acagcaacat tttcatgtgt ttatatacag 

50 

aaattttctt tttaaaagat aaagaatatg 
tggccaaaca tacaataatg ctccaattca 
55 acaatgaact aatatttaca tcactagact 

tggtatttat cactggtgaa aacaatgttc 
cactgccatt ttgggaagca aggatacaac 

60 

ttaacccaga tgtctatctc tacagtaatt 
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ttaacaaaat 


caaattcagc 


ataatatatg 


78600 


cagaagtaca 


caattgattt 


cacattcaaa 


78660 


tcctaaacta 


taaagcttga 


gtgggtgggc 


78720 


ccatggtagt 


agacaaagct 


ttcttagata 


78780 


aataaaataa 


aatggactac 


caaagcacag 


78840 


atctctctct 


cacaaaagac 


ccacatccag 


78900 


taaaaaggcc 


aaaaaaagat 


taagacttga 


78960 


tggctagtga 


gcatatgaga 


acgtgctcaa 


79020 


aaatcacaat 


aagacactac 


tccacacaca 


79080 


ataccgggtg 


ttggaaggaa 


tgtgaagcaa 


79140 


gggaaacgga 


aaatggttca 


accaccttgg 


79200 


atcatacacc 


tactttatgt 


ctcatgatat 


79260 


ttatagttaa 


actgataagg 


aactcaaatt 


79320 


aataatagaa 


tacttctcag 


caatataaaa 


79380 


tgaatttcaa 


tttattatac 


cgaggcaaac 


79440 


taacacatga 


tgctagaaat 


cagaagggtg 


79500 


tgggaaggga 


cacaagagaa 


cctactgggg 


79560 


aagaaaaaga 


gataaaagac 


atttcttctt 


79620 


tgcccaggct 


ggtgtgcagt 


gatgtcatct 


79680 


caagtgattc 


tcctgtctca 


gcctcccgag 


79740 


cccagctaat 


tttttgtatt 


ttagtaaaga 


79800 


tcaaactcct 


gagctcaggc 


aatccaccag 


79860 


gcatgagctg 


gcgcaaccag 


ccaaaataca 


79920 


aaaatggaaa 


atatagtttt 


aaagcagata 


79980 


agtgcatata 


actgacaaga 


agtcaagaaa 


80040 


aggaaagaat 


ttacaaaaag 


gaaactacaa 


80100 


ctaatagtca 


gggaaatgtc 


aattaattta 


80160 


ggcaaaaatt 


aatgagttat 


aatttatggc 


80220 


ttgtatatca 


ggactagaag 


tgtgcaatgt 


80280 


atttacctac 


ctgaatgata 


aacagacctt 


80340 


aaaccacctg 


tatataatgg 


tatatgtaca 


80400 
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acaatgctat tgcagcattg ttcagagggg 
aaagaggaaa tcaatcagtt cctgtcacaa 

5 

atgaaatatt aagtggccat taaaaaccat 
ttaaaatata gtagacaaat aaggcttcag 
10 acctaagtat aaagtaaagc tctcagaaga 

gcagcctctg ggctactgac aagatgtgta 
caaaatattt tgaaggtctt ttgctgtgat 

15 

gtggctcatg cctgtaatcc cagcactttg 
caggagttca gcgctgccaa catggcgaaa 
20 ccgggtttgg tggtgggcac ctgtaatccc 

gcttgaacct gggagttgga ggttgcagtg 
ggctgataag agtgaaactc cgtctcaaaa 

25 

aacggtgaac tactagcgta tgcttcttat 
atagttttct tttttccctc tttttctttt 
30 ccttctttct ttccccttca ttagaccacc 

catttataaa atggactatt tcctacatat 
tctctaaaaa gataatatgc tatgctatgc 

35 

tatgctatcc tatgctatgc tagctatgct 
tgctatgcta tgctatgcta tgctatccta 
40 tcctatgcta tgctatccta tgctatgcta 

tactatgcta tgctatccta tggtatccta 
tgctatgcta tgctatgcta tgctatgcta 

45 

tgcaatgcag tgctatgcta tcctatgctg 
tgctatgcta tgctatccta ttctatgcta 
50 ctgaaaatcc ttcactgatt tgattcctaa 

gcttttatta tctgggcatc actatgtcat 
atcatgactc ttcatttttt tttcttcttc 

55 

tgactttcag cattattgaa ttattcattt 
cagaggattg tgattgtgaa caatgttgtt 
60 aaatattatg tagcaatgac ttctgcttgt 
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taaaacaaac 


aaacaaaaaa 


acacagaaaa 


80460 


aagaataaat 


gaatgatgac 


aaactacacc 


80520 


gtggctctaa 


aatagctctt 


caatctatag 


80580 


gagaggatgt 


taaccatatt 


aacctttgca 


80640 


ctgcaaactt 


taactggtaa 


gtgcaggtct 


80700 


tctctatttg 


tgctcatagt 


tgccaaaata 


80760 


tgaaaaaaac 


acaggtgggg 


ccgggccatg 


8082 0 


ggaggccaag 


gtgggcagat 


cacctgaagt 


80880 


ccccgtctct 


actaaaaaca 


caaaaactag 


80940 


agctactcgg 


gaggctgagg 


caggagaatc 


81000 


agccgagatc 


ctgccattgt 


actccagcct 


81060 


aaaaaaaaaa 


gacaaaaaag 


aaaccaaaaa 


81120 


gtctctttcg 


ctttctcttt 


tactaagcca 


81180 


ctatttttag 


ttctttttcc 


tgtttcttca 


81240 


ttttatatat 


gcatttcttt 


gcattttatg 


81300 


ataaatcaaa 


aatgtatttt 


tatatgcctt 


81360 


tatgctatgc 


tatcccatgc 


tatgctatgc 


81420 


atgctatgtt 


agctatccta 


tgctatccta 


81480 


tgctatccta 


tgttatgcta 


tgctatgcta 


81540 


tgctatccta 


tgctatgcta 


tgctatccta 


81600 


tgctatgcta 


tgctatgcta 


tcctatgctg 


81660 


tgctatgcta 


tgctatgcta 


tgctatgcta 


81720 


tgctgtacta 


tgctatgcta 


tcctaggcta 


81780 


tgctatccta 


tgcttaaata 


gatgcatact 


81840 


gcatcagata 


cacttgagat 


atttggggaa 


81900 


tataatcagg 


aaaaccgtta 


ctgtgccaac 


81960 


ttctattaca 


gggaattttg 


attacattga 


82020 


ctctgccttt 


ctcaattttg 


gggtgccact 


82080 


gactagcact 


gtgagaataa 


agatgtgtta 


82140 


aagggaaagc 


caaaggacac 


atagaatctt 


82200 
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aaggctggga actctttgaa gcaaagggga 
tttgactgat aaagcacatt ctgcctcttg 
5 aatggtcacc ttggtgttgt cacctctgtt 

aatgtggtgt tgcaaagata ggaaccaaag 
cttctctggt tgatgacatc tcctgttgat 
actccagtgg tgtgtggaaa gatgagctgg 
cattaatgct gcaagatcat caatgttaaa 
15 aaaaataatg aaggcaaata ttggtggaac 

attatatagg tgttactcat ttaatctcac 
tacccatttt acagaaaagg aaactgaagc 

20 

ataaagtttg caatggcaga gccatagtat 
cttaattcca atgccaagca aagtttggaa 
25 gcttttgcat tcaatctgtt atgatatctc 
taaacttgtg aaaagagaaa taaggtctta 
agggcaccta caaaggtctc aaggacccca 

30 

taccttatat ggtattgcct ctcactggct 
tgtatcatct cacttgtcct gacagccctc 
35 aaaattgaag ctcaaagtgg ttagcttttc 
gaggatttga gtcaagttcc ctaacttcaa 
gaaaaaaaaa actcagcata ctcaaaaaat 

40 

ctggcagcag cttaggttat aggtgcattt 
aaataagtaa gttttttact tcagtgaagt 
45 aatgatgtac agatgtgtga ggcttgagat 

aagtcaacta ggtttaaatg ttgtcttgaa 
aaaagggaat gcttacacac ttctgggggg 

50 

agcagtttgg caatttctca aagaactcaa 
attattgggt acatacccaa aggaacataa 
55 tatgttcatt gcagcactat ttgcaatagc 

aacagcagac tagataaaga aaatgtggta 
agaaaagaat gagataatgt cctttgcagc 

60 

aagcaaacta acacagcaac tgaaaagaaa 
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gtggctcaag tgaaggggtt ctggggacat 82260 

gcaatgccat ctgcacctct ccccaaatca 8232 0 

agtcatgcat tatttcaact aaaggacagc 82380 

cccaaggcct ggggtgaccc agtcatctaa 82440 

ttcactaaga gactccaccc agtggctcct 82500 

ttggtggaat ggtttggccc ataaattcta 82560 

gage tga tea actagaatac taacaatagt 82 62 0 

cactactata ttccaggcat cattc'taagt 82680 

aaaagtccta ggaggtgaat tgtattatta 8274 0 

acagaggggg taagtaattt gctcaaggtc 82800 

caacaggcag ccggttccag aatccaaggt 82860 

ggaactactt tcctctggat tctcttagca 82 92 0 

gtgttctgta gectatggag tatactactg 82980 

gtattattac aaaaatagtt ttgaacttat 83040 

ggaaatccga gaccacactt tgagaaccac 83100 

gaatacctcc taagtattag aggtctcacc 8316 0 

agggtagata tgtcttttgt catagtcata 83220 

tgagagcaca aaaggagcat gatttagaaa 83280 

atcccattat cctatgttgt ttatggtata 83340 

gaagtaacag gctttcattt ctactatttg 83400 

gtgaaggtga eggatacaac atggcaggaa 834 6 0 

taggttatga tgaaaaatac ttggcaactg 8352 0 

atatttgagg gtgcaaagtc ttgggctttt 83580 

gtcaaaaaat agtagatgea agttgtggag 83 64 0 

aatgtaaatt agttctgcca ctgtaaaaaa 83700 

agcaaaatta ctattcaacc cagcaatccc 83760 

ategttctae cataaacaca tatgeaegta 83 820 

aaaaacgtgg aatcaaccta aatgcccatc 83880 

catatacacc atggaatact acatagecat 83940 

aacatagatg aagctggagg ccagtattct 84000 

atactgeatg tgttctcact tataagtggg 84060 
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agctaaacat tgagtacata tggacacaaa gaagggaaca acaggcactg gggcctactt 8412 0 

9 a 999tggag 9 a tgggagga gggtgaggat caaaaaacta ctattgggta ctatgcttat 84180 

5 

tacctgggtg acaaagtaac ctgtatatca aacccccgtg acatgccact tacctatata 8424 0 

acacatctgc acatgtaccc tgaacctaaa ataaaattta aaaaacaaag aagaaatgtt 84300 

10 aaaattatta gaaataaaga tagaagactt ctttacaaag tcaaggtagt aaagaatttc 84360 

ttaaacagat cccagaacat gaatagtacc agcaagaact gatacaaccc atttagtcgt 8442 0 

tattaagaat gttacataag tgtgacgaga gcagctacaa atcaaagcct tttagcaaca 84480 

15 

caaataacca acaaaaataa tgggatatag aatatataaa atcctcctat acattaatag 84540 

aaaaatacaa cttgatgaaa acttagtcca aagccatgca cagccatttc acaggagaca 84600 

20 gaaacaaagg ttaatatatg aaaagcactc aatttcttaa gcaattgtgg aatgaaagga 84660 

actctcctac cacaatggta tgagtgtaaa tgaagacaac cactttaaaa actattccct 84720 

agtaaaggta aagatacaca tactttataa cccaacaatt ccactgctag ttaatatacg 84780 

25 

caagagaaat ccttgtacat gtgcaacata tgacacgtac gaatgtttcc agagcaggat 84 84 0 

tgcctgcaat acctcaaatg tccatggaca gagaacagat caattgtacc atgttcccta 84 90 0 

30 ctggaggacc acatattggt aaaaataaat tacttgaagc aaatcaacaa agatgaatta 84960 

tacacattta atgttgagta aagaaagcaa gtcacaaaag aacatcttat taattctacc 85020 

taaataatga ttagaaaaca caaaaactgc tctctccctc tccctctccc tctccccgca 85080 

35 

gtctccctct catgctgagc cgaagctgga ctgtactgct gccatctcgg cttactgcaa 85140 

cctccctgcc tgattctcct gactcagcct gccgagtgcc tgcgattgca ggctcgcgcc 85200 

40 gccacgcctg actggttttg gtggagacgg ggtttcgctg tgttggccag gccggtctcc 85260 

agcccctaac cgcaagtgat ccgccagcct cggcctcccg aggtgccggg attgcagacg 85320 

gaatctcgtt cactcagtgc tcaatggtgc ccaggctgga gtgcagtggc gtgatctcgg 85380 

45 

ctcgctacaa cctccacctc ccagccgcct gccttggcct cccgaagtgc cgagattgca 85440 

gcctctgccc ggctgccatc ccgtctggga agtgaggagc gtctctgcct ggccgcccat 85500 

50 cgtctgggat gtgaggagcc cctctgcctg gctgcccagt ctggaaagtg aggagcgtct 85560 

ccgcccggac gccatcccat ctaggaagtg aggagcacct cttcccagcc gccatcacat 85620 

ctaggaagtg aggagcgtct ctgcccggcc gcccatcgtc tgagatgtgg gagcgcctct 85680 

55 

gccccgccga cccgtctggg atgtgaggag cacctctgcc cggccgcgac cccgtctggg 85740 

tggtgaggag catctctgcc cggccgcccc atctgagaag tgaggagccc ctccgcccgg 85800 

60 cagccgcccc gtctgagaag tgaggagcct ctccgcccgg cagccacccc gtctgggaag 85860 
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10 



tgaggagcgt 


ctcccggcag 


ccaccccgtc 


ccaggccagc 


cgccccatcc 


gggagggagg 


acccgtccgg 


gaggtgaggg 


gcgcctctgc 


cctctgcctg 


gcatcacccc 


gtctgggagg 


tgatgacgat 


ggcggttttg 


tggaatagaa 


atcagattgt 


tgctgtgtct 


gtgtagaaag 


gtactaagaa 


aaattcttct 


gccttgggat 


tatactctct 


gaaacatgtg 


ctgtgtccac 


gatgtgcttt 


gttaaacaga 


tgcttgaagg 


cctaatctca 


agtacccagg 


gacaaaaaca 



20 

gaaagccaga gacctttgtt cacttgttta 
tgaccctgcc aaatccccct ctgtgagaaa 
25 gagactataa acataatttg agtgtttaga 
taaataaata aaaataaaat taaaaaaatg 
aaagctggag aaaataaata ctaaagtgtt 

30 

aattagagga ggatgtgaaa gcacagcgta 
ttgaggtgaa agagaaataa ccataaaagg 
35 attggtttgg agaacggaaa taaaatctac 

gcacacaggt tggagcagag aaagaggaaa 
cattgatacg ttcttgccta tctctcctcc 

40 

acctgtgtgg gttctgttct ctagattgga 
gaggccagtt gaaaaatgag ggagaataaa 
45 agaagaggaa tgaaaaagta agttggttgc 

aaaaaaaaag ctaatgctcg ctgggtgcgg 
gaggccgagg caggagaatc acctgaggtc 

50 

tgaaacccca tctccactaa aaatacaaaa 
atcccagcta ctcggggggc tgaggcagga 
55 cagtgagccg agatcgctcc attgcactcc 
taaaaaaaaa aaaaaaaaaa gctaatgcta 
ccattccatg aacatgggcc caaaaggacg 

60 

atgtggctac cacagggaca acatagaaca 
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cgggagggag 


gtgggggggt 


cagtcccccg 


85920 


tggggggtca 


gacccccgcc 


cggacagccg 


85980 


ccggccgctc 


ctactgggaa 


gtgaggagcc 


86040 


tgtacccaac 


agctcattga 


gaacgggcca 


86100 


aagggggaaa 


ggtggggaaa 


agatagagaa 


86160 


aagtagacat 


gggagatttc 


attttgttct 


86220 


gctgttgatc 


tatgacctta 


cccccaaccc 


86280 


tcagggttca 


atggattaag 


ggcggtgcaa 


86340 


cagcatgctc 


gttaagagtc 


atcaccactc 


86400 


ctgcggaagg 


ccgcagggtc 


ctctgcctag 


86460 


tctgctgacc 


ttcccttcac 


tattgtccta 


86520 


cacccaagaa 


tgatcaataa 


aaaaaaaaaa 


86580 


aatttaaaaa 


taaaatccca 


gtaaaaaaaa 


86640 


cagtatgata 


ttgcaatgat 


tgagcattag 


86700 


tgggagggtc 


acaataggct 


tcccagtaaa 


86760 


ttccaaactt 


tacagttgca 


tttacatctc 


86820 


actctggaac 


tacataaaga 


tacaagacaa 


86880 


ctataaaatc 


caggtaaata 


tctgatactg 


86940 


catagaggtg 


ccaaaggaac 


aaagtaagtc 


87000 


aaatcaatgg 


gcacaaactg 


tggctggtct 


87060 


gggatgaaga 


caagttcttg 


actctatgtt 


87120 


accatgaacg 


aaacaagaaa 


gaaacaaaac 


87180 


tgtacaagag 


ataaaacgga 


ctgcaaaaag 


87240 


tggctcacac 


ctgtaaaccc 


agctctttgg 


87300 


aggagttcga 


gaccagcctg 


cctaacatgg 


87360 


attagctggg 


cgtagtggca 


ggtgcctgta 


87420 


gaatcacgtg 


aactcgggag 


gcggaggttg 


87480 


agcctgggtg 


acaagagtga 


aactccatct 


87540 


agaaaggtga 


ataataggtt 


caactgtatg 


87600 


aggccatgcc 


ccactgggca 


cttgatgatt 


87660 


cagaagtaat 


actggagccc 


ttggctttgt 


87720 
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aacacatttt aattaaggta cttctttctg 
gggaaagaac attcattttc accagatcag 

5 

ctgtggatgt actgggtgat tttgtgaaaa 
taaagtactc ttgaaaataa taatcatctt 

10 atgaggtaga aagaaactat aaagaagtaa 

atcacagagc aaattgctga ggttttctct 
tcagagctcc aatattttta tgtgttcctt 
tcttattttt aaaaaaagga caactagggg 
tcctactaag tcaataggtg gaatctataa 

20 tgtatcattt acagaattgt cttcatttca 
gctctgtctt taattccttt tataaaacca 
tcctaataac tatagaccta gccatgaata 

25 

tagtccagaa actaaacagc agatatgaaa 
attgagtttt cctcataaaa tttctcgaga 
30 agaatgtaat gaaatcatat acatgcagaa 

taaaagctca aaaaaattct tgttctggtt 
tttctttgct cttgaaattg accctgttca 

35 

atccactgcc taaaacaaga tagaatacct 
tttaagctct tgccatgtgc taggcactgt 
40 tgaactttag actagcctcc ttaggtacct 

taacagggat ttcattaaga aacttgctta 
aagattagaa aagagggctt gagcaatatt 

45 " ~ 

taacctcaca ttctcaaatg ctattgagag 
atggacttgt ttcctttttc cttgtttaaa 
50 attttttttt ttttggagat ggagtctcgc 

gatctcggct cactgcaagc atccacctcc 
tccaagtaac tgggaataca gggggacacc 

55 

tgtattttta gtagagacgg ggtttcacca 
ctcgggatct gcccgcctcg gcctcctaaa 
60 aaattcctct gtaatcagat atttcagaaa 
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acaagaaaac 


aattaaaatt 


aaaataggta 


87780 


cattttgtga 


aaggtgatat 


aagtgtatac 


87840 


tgattcggtt 


tcatgttttg 


ctttatgcta 


87900 


cactctgtca 


aaagtgaagt 


tagcatgaaa 


87960 


atttttccca 


tttttatttg 


aaataatgta 


88020 


aattgcttta 


gtcttatact 


tttgatctca 


88080 


gttcttaaaa 


ataattgcag 


ttgttctaat 


88140 


taaatatttt 


gtcaactgtc 


aagagaacaa 


88200 


aaggagtcta 


tttattagtt 


tcccttctca 


88260 


attgaataat 


taaagaatca 


gaactggctg 


88320 


gaacttgacc 


cagtaaaaat 


gtattctgag 


88380 


atctgggttc 


tatgtgaaat 


ttacagatct 


88440 


ggagaaactg 


ttgataaata 


aatgaaaagt 


88500 


caaaaaaaaa 


gcaatgatca 


acagcaaagg 


88560 


acgtggacaa 


ttctttaatt 


gtatgctttt 


88620 


aattactgta 


ctctgagtac 


ccatctccta 


88680 


attcctttta 


taaacttttc 


ctatggccac 


88740 


tgggatctaa 


taataaaata 


atagttcctt 


88800 


tccaacagtt 


ttgcataaat 


taggataatt 


88860 


tctgtcattt 


tacagttgag 


acataggaac 


88920 


aagtcccaga 


cttagtgaag 


tagcagattg 


88980 


tggccaccga 


aaccagatcc 


ataggatgta 


89040 


ttacgtggtg 


ttactccaag 


agtcacagct 


89100 


ataaactttt 


gtgggctcaa 


atgtgtttac 


89160 


tctgtcaccc 


aggtgggagt 


gcagtggcac 


89220 


cggggtcaca 


ccattctcct 


gcctcaacct 


89280 


cctacacccg 


gggaaatttt 


tttttttttt 


89340 


tgttagccag 


gatggtctcg 


atctcctgac 


89400 


gtgctggatg 


tgtttccatt 


ttgaaaaaaa 


89460 


gatgttttct 


cccttttcat 


tgagttttca 


89520 
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accattttca gttgcatcta gtatcggagg 
aattctctca ttgtgaaatg gggattttag 
5 gggtacagag ataatgtatc tcatgcactt 

taatcggcac aattccacat actttgcaga 
ctaaggaagg tagtcttatt agctctattt 

10 

taacaaactt gccaaagaca caaagttgga 
gacactatac agaccctact gtgcctataa 
15 cacctcacct aaagcttgct aaagagaatc 

aatccggttt ataaactcct ctccatggca 
ttgcaggagc agagtcaata ccctgtcacc 

20 

tccagagtga tgctgcatgg agggggattc 
gtcctaactc tgccacatgt tttctgtgta 
25 tcaatttctc atctgtaaat caaggggaat 
accctaaaat ttctggattc tatgagcaca 
agtgatacat tgtaccaata ggttattcat 

30 

attcatcact ctaactttct t gat at teat 
acctcaaaaa aaggaatcta aggctaaaga 
35 agaattaata gtcattgttg tccatattca 

ccttgcctac atggggcctg agatgetcta 
catttttcct aaagaacaga agaatcatct 

40 

ccaattactg gcaaacaatt ttaatcttgg 
tacttggccc ctgtattagt tttctatttg 
45 gcttaaaaca atataatttt tttatctgac 

tcactagact aagttcaagg tgtcagcaga 
gaatctgttt ccttgtttca tccagcttct 

50 

cgccttcctc caacttcaca gttggcagca 
gatgaaacta ttccacctca ggtcatcagg 
55 agatccccaa catgtgcagt tcacagtagg 

gctggtctga taggaggcag agctcaggca 
gtgcagccag gtttccaaca gaccacagac 

60 

acttatgaga atcatgtttc tatcagttgt 
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gtaagggtta 


tgagtacagg 


ttctggatta 


89580 


tagagtacct 


atctcatggg 


gttctgtaaa 


89640 


aacatagaaa 


taataataga 


gccaaccctg 


89700 


gaataactct 


ttaatactca 


aactgactct 


89760 


tacagacaga 


aaacaaaggc 


aaagagaggt 


89820 


gttagatccc 


aagttcatgt 


tcctaaacat 


89880 


taagtcttta 


ataaatgtta 


gctctctata 


89940 


cttactaact 


cactcaatca 


caaagaagtt 


90000 


tcagaagaca 


gaagttttgt 


tgttgggtag 


90060 


aaggctggcc 


tggaaaacaa 


getgeaaatg 


90120 


tgaaccaaca 


gecaagaate 


ccattttcaa 


90180 


accttgagca 


tgtcacttaa 


cgactcagct 


90240 


gagacaggat 


aaatggaaag 


ggtcttttct 


90300 


gggtctgeat 


atgtgtcctt 


gttagatacc 


90360 


agtattcata 


cttcatacca 


agaatgagac 


90420 


aagcacaaga 


aaattgttgt 


gcttttgaag 


90480 


caaatttttt 


aagtgtcata 


atttttttca 


90540 


gttgtactgg 


cttatttcca 


acaaaatttc 


90600 


agctttcaga 


aggacagaaa 


tgaataaaag 


90660 


taagtgggca 


tatgtgaact 


atctagtttc 


90720 


actgccctcc 


ttgecataca 


agtattgtat 


90780 


ttgatataac 


aaactaccac 


aaatgtagtg 


90840 


agttctgtag 


gtcagaattc 


caataccagc 


90900 


gtcatgttcc 


ttcctggaga 


ctctagggaa 


90960 


agaggegace 


catacacctt 


ggttcatgga 


91020 


tagcatctct 


ctaatgaggg 


atggtttcag 


91080 


aattagagtc 


tcataaggag 


catgcaacct 


91140 


gttcatgctg 


ctatgagaat 


ccaatgccat 


91200 


gtaatgcttg 


cctgctgctt 


acctcatgta 


91260 


aggtaccagt 


ccttggcgtg 


ggggttgggg 


91320 


tatggeccat 


tatctgetea 


cttgttctct 


91380 
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ccatttgcat 


agataattga 


catttgacat 


5 


gataaacgta 


tcacttttgt 


atgttttgtg 


caacaagcca 


tgctgaagta 


aatgaaacca 




tggctcctgg 


atttcaacag 


cctctgggtt 


10 


ctcagcgtgc 


tcagccctac 


ggcatcacat 




gtcaaggaaa 


tatacaaatg 


ataaatccaa 


15 


aagaagcaaa 


ggcactaggg 


gtaagtctat 




aggtcttctt 


ataagttata 


ggagagtatc 




tctgaaatta 


tctcttttag 


acagaaatat 


20 


cat tt cat tt 


tagatcaagc 


aaaaatataa 




aatttggtaa 


tattgaegtg 


acagaagcac 


25 


aaggactttt 


tggacaatct 


caatcccatc 




taaaggtgaa 


tttacataac 


aatgaacaat 




atgggcagtc 


tgggaaaggc 


tgagtgtgaa 


30 


tcttt caeca 


atacccttag 


agatgetaca 




gactctgaac 


tgactgeett 


gtttctgtag 


35 


aagaaaattc 


cacctgccta 


ggtagaattg 




tcattcatcc 


cgctagcaca 


attccaccaa 




caaaatgetc 


tgctggagca 


ttccgtaaat 


40 


agecatctta 


ctatgtattt 


caagttgtga 




tctcctgcag 


cagectattt 


actaagctgt 


45 


agatagagga 


aattccctgc 


tcctaataag 


cagccccaca 


caggtgcttg 


gaatatcatt 




acagaactta 


cgccctagag 


ggggaaatca 


50 


ttagtgtgtt 


agaaggtgtc 


gaatgatata 




aagtactagg 


cagtgaggag 


gcagcaattt 


55 


catgacattt 


gagcaaaaac 


ctgaagttag 




aaaacaacca 


gtgtaaaagt 


ctttagatga 




aggaggecaa 


taaaatagaa 


cagaatgagt 


60 


tgagtggcag 


aggecacagg 


tggtgaggcc 



ttggtaagat tttctttagt ttcttttctg 91440 

atacttagga cataatgatg tcatccaagc 91500 

tacccaaccc ttacccacca ggcagcttta 91560 

caatcaactt agaaaaccaa gctcagggtg 91620 

etcegggaat etttgetage agtcaacegg 9168 0 

gtgtgggaac agcagtaatg aactttaaag 91740 

ttactaccag aattttaatt teacattgea 9180 0 

atecaattge taaaaagtt.c tgaacgttat 91860 

ttagtctgee tatagacttt cccttgagtt 91920 

caagacacag aggagcctcc ctagctggag 91980 

tgttacatct agcttgatct catttcttca 92040 

tctggattag ttacaagtct ggtttttgca 92100 

gatggctgee cccagaaaag gaaactacaa 92160 

agaattgggg gacattccat ctgctatgcc 9222 0 

gtttaactgt caccttactt ctacaaagga 92280 

gcattgtaaa aacaaaaatg ccctacttgt 92340 

ttagtccctt ctccgtgctt tccaaataca 92400 

attatattaa gtctctccct gtgaatagct 92460 

gaatgetcat tatgtgccag aaactcttct 92520 

tgttttaggg aagtaaaaat gagcataagt 9258 0 

ggaggtggcc cttctgatcc tcctcctccc 92 64 0 

ccaatgggac aaactaacac cccatacagc 92 70 0 

aagcacaaag gaggtgcaat ctctgccctc 92760 

gactgtatac ataataagga agtaaagtac 92 82 0 

gacaaaacaa agaagtgtaa aagggctaaa 92880 

caaggtggtc agaatgggct ttccttagaa 92 94 0 

gagtgagece tgcagatatt agggaaagag 93 00 0 

aaatgtcttt gaggtatttg aagaaccggg 93060 

gagggagggg gcagttgtca ggagggcaga 9312 0 

tgcagagcag aaggctttgg ctttgactgg 9318 0 
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gtggatgagg aaccactgaa gcatgctgcc 
tttaaaagaa tgtgaagttt tgttttcaaa 
5 tctgaagatt tgttatttct atgccatcag 

ccacaggtgg caggacttgg cacacctgta 
tggccaaaaa tgcagacaaa ttgtatgtcc 
tgatcagtag atggtacagc cagggctgta 
gatctaactg cccctcatca aggtgacagc 
15 cggctgaatt gctgctctaa agaacagttg 

gttctttaag tgagataggc taaacgtggc 
tatggtatgt gtgtgtatga atgtgcaaga 

20 

tttattgatg tctcaagaaa cttttagcat 
taattcgatt tacttgtgtt cagtagctat 
25 tatttgcagt ctccctggat agataagatg 

acaatcaagt tttggaccaa tctagaaatg 
aatgaaggaa ggttcaataa gaggtacaag 

30 

aatatatgat tgataagaag aatgattggt 
aaaaaaggac aatatacact ttgggaggcc 
35 cagaccttcc tggctaaccc gtgaaacccc 

ccaggcatgg tggcgggagc ctgtagtcct 
gtgtgaaccc gggaggcgga gcttgcagtg 

40 

gggcaataga gcaagactct gtctcaaaaa 
tataaatgaa tccaaaatca agtaataagg 
45 ttcagaaaga aagaaaaaag attaatttgg 

gagcagagct gagcatgggg gaagacagta 
tgtgctgcag taaattccca gatctgcctc 

50 

atctttatgt ctctgggact caatccagtt 
aagtgttcag aacatgacca agcacataat 
55 aattccttta cccgtaactg ggatgaccca 

ctactgacat cacagttaaa gtattgtttt 
gctaatgcat gattaagcag ctgtactttc 

60 

attctccaaa tacctcaaaa cagggcgttt 
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cggcaccgaa 


gaggcgtgtg 


ataaaaatac 


93240 


atctccagtt 


ggcagaagct 


acttcctctc 


93300 


caggccctgg 


gcatcatgac 


agtgcagggg 


93360 


ccagtcaact 


cagaccttaa 


agtgcttctg 


93420 


aaatttttaa 


gtctgtccta 


taggcaattt 


93480 

i 


ggaggagaaa 


tgagggccag 


tacaaatcca 


93540 


aaggatgaga 


atgggaagtt 


tgtgaccaga 


93600 


taaaaaccaa 


atataaagtc 


taatacagga 


93660 


ccatcatctc 


tctgaagcta 


taagcaaaag 


93720 


ataatttttc 


agggagaaag 


gttcatggtt 


93780 


tgtttatgaa 


actgggtttt 


tttctaattt 


93840 


ttattgaaga 


aataaataca 


gcagtctttg 


93900 


taaccatggg 


gaaaaataaa 


ttacaaatct 


93960 


attattcctc 


agaaatctat 


ctttatttac 


94020 


gaggtaaaag 


atgtatacct 


tacttttagt 


94080 


aagaaaatga 


ttgcatacct 


gcaacttttt 


94140 


aaggcgggca 


gatcaagagg 


tcagaagatc 


94200 


gtctctacta 


aaaatacaaa 


aagaaattag 


94260 


agctactcgg 


gagggtgagg 


caggagaatg 


94320 


agccgagatc 


atgccactgc 


actccagcct 


94380 


aaaaccaaaa 


aaaaaccaaa 


aaaaaaacaa 


94440 


tgtgttttat 


gtcatgagat 


gcaactgaaa 


94500 


tgtaatcaag 


aagacttcat 


ggaaaaggta 


94560 


ccgaataatt 


cagacaggct 


cagcagtcaa 


94620 


ttaggaattg 


tgtgaccttg 


gatacctgat 


94680 


gtaggaggat 


tacatgaaac 


aataaatgta 


94740 


tcatgcctaa 


taaatggttt 


tatcttcacc 


94800 


agcagagaac 


tgatccatct 


gaaaaagcag 


94860 


tctaatagtc 


ttttagaggc 


cctcagagat 


94920 


tgaagaattc 


ttcttgggcc 


accccagtaa 


94980 


cactaagggc 


cttgcactcc 


agttattcta 


95040 
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gtatatgtca gaatcttcat gacccacctc 
atggatctca gcaaaattga caataaataa 

5 

acgtctgaaa gtgaagaagt aattttaaat 
catggttgga ttgatgcaca ttggttttgg 
10 tagagaagta ttaggttttg cctctactgc 

cctttctgtg agtagattgc tagaacacca 
tggtttataa agtattgcta tctccagcca 

15 

catctgaggg ctggagaatt tagaaaaatt 
gaaaaattaa cagctctgaa tgagcatgaa 
20 aggagcttat tccattcttt aaaatttaat 
cctactacat ttgagatgga aatgtatgta 
gattggaaac atgattaaaa atatgcaagt 

25 

cccagctact tgggaggctg aggccaggga 
ctgggcaata tagtgagact caatctctaa 
30 attatgagaa gggatggagc agtaagcctc 
cttcataggc cagaaattaa ggaaaagaaa 
gagttactcc agaaaattca gaagtgcgag 

35 

ataaatagac caacctttca tataaagaga 
cacaaataaa tttgtccatt tccacagttt 
40 tccaaggagc tttcccgttg tctggtaagt 

aagattagct taacatattg gggaggaaaa 
ggagcacctt tcccaaaaaa aatctattgg 

45 

aaatggtcca cacagtttgc cactaagaaa 
tcacaaaggt gagaagtagt aaaaatcagg 
50 ggttcagtct ggtcacactc tattatggga 

tacatactag acacatcagt taccaagtct 
tcttgtattt ttattttttt atttgtgcta 

55 

acaattcttt aaattgccat taaatgagtc 
atgatgagta agacaggact ttgagctctg 
60 cgtggtctac tattgaacca ggaaaaagaa 



PCT/US01/48437 



ctgcttcaga 


tgtgtgcctg 


aatgacaacg 


95100 


gcaggggcag 


caatgacagt 


aatgtttcac 


95160 


ggctgttttc 


ttttttcagg 


tgatccagat 


95220 


aattgttttg 


tgtttaatat 


ccttctcttt 


95280 


tgttattggt 


ggatacccat 


tctggggtgg 


95340 


gtccttcttg 


gtttataaag 


tattccctct 


95400 


attcacaact 


catttcttga 


taaagcccta 


95460 


aacagctctg 


aatgagggct 


ggagaattta 


95520 


aaaatcggga 


actctttcgc 


cattttacct 


95580 


catgtaatca 


gcaagccaaa 


gttttgatat 


95640 


ttgggtggag 


agagaaagaa 


aagaaaaggt 


95700 


aagccaggca 


tagtggtgtg 


tgattgtagt 


95760 


attgcttgag 


ttcaagagat 


ggaggccagc 


95820 


aaaaattttt 


taattaaaaa 


aatgcaattg 


95880 


agaacaagga 


atgcttaaac 


attgtgagtt 


95940 


ataaaaaaga 


agtcaaggac 


atatgactga 


96000 


agaaaatttt 


gagaaggaaa 


ctagatgaac 


96060 


aattcccaag 


cacaaccagt 


taacccattt 


96120 


attatctctg 


gctctctctc 


tgtgtcagca 


96180 


tagactgtct 


ctactttttg 


aaccccttta 


96240 


ttgaaaagct 


ggatttggga 


aatgcaaaag 


96300 


ttgccttatg 


gtctgagcgc 


tggggtgaga 


96360 


tggcagagct 


ccatatcctg 


actcttgaat 


96420 


gttgtttaag 


ctttaaccta 


aaagctccct 


96480 


agtccaaagg 


agaatccagg 


agccaggaaa 


96540 


ctactgttca 


tttgccactt 


ctcagctctg 


96600 


tttcctactc 


ttacacaggt 


tgtgtgtttt 


96660 


tagcttgttc 


tgtaagaaga 


gtttataaaa 


96720 


tcggtcacag 


caccccacct 


attccaattt 


96780 


agtgatttat 


caagaactag 


agtcaactag 


96840 
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agcagaggtg 


gtgtcaagat 


tatatgtaat 




gaacacagaa 


tcatgtagct 


catgctcttg 


5 


aattaaaata 


aaataagcct 


aattctgatt 




taaagcaccc 


tttctctgca 


tcttttaagg 


10 


atagattcca 


cctcttagag 


gagacagcca 




agcttttaga 


gaacttcagg 


aggctctaag 




actcgactgt 


gccaccccta 


gcaagtcacc 


15 


catgttaaaa 


tcctattgta 


gaaactgggg 




ccacaaaagc 


ctaaattttc 


tccttctaaa 


20 


acttagggtt 


ttcattaatt 


cctaaatttt 




ccatctataa 


aatcatatgc 


tatcgtacta 




atttaaataa 


tttaacacag 


tgtctgacac 


25 


cattttttta 


taaagaaaat 


atagttgtta 




agatgagtgc 


ctgctttata 


tgatttacat 


30 


tctaggctgc 


atacagaata 


atgtttatgg 




atttcaactt 


gaaaaattgt 


tttgaaagtt 




ttcttatctc 


acttaacttc 


agtgatcctg 


35 


ttaggtgaaa 


ggcagcctgg 


gaatgaacat 




gattctgctg 


ctggtggata 


tgtgcatcaa 


40 


ggtaagtatc 


ccatactcca 


ccatgtgcct 




atgttttatt 


tcttaatacc 


gtataccagc 




cccaccccta 


gtcatatgat 


ctctaggact 


45 


gatagaccga 


ggcttaaagg 


acatccttta 




aatggggaag 


ggtttggata 


ggcctccatg 


50 


caacatttga 


aattctatat 


aaataaggta 




tcagagtaaa 


gtatgaagtg 


agagggaaat 




agaaacccag 


gaatgaagca 


attgctaaat 


55 


acaatcacac 


attcattagc 


cttatatagc 




gggtggccca 

Zi Z) Zj zj zj 


agggcaggaa 


tgcttaccat 


60 


tgaggttggg 


ctggaagctg 


tcatcctacc 




attctgtgag 


caatggaaaa 


ccacagaaga 
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ccagtacaag 


aaagaagaac 


ggggaaagag 


96900 


cagacaaaga 


agaataaaat 


aattaaatca 


96960 


aaccactcct 


gttgcagtcc 


aaaaacaaaa 


97020 


atccttccgt 


ctaattgaga 


ttgtcgaagc 


97080 


ctgacacata 


cctaagacaa 


tttttctaag 


97140 


gcacgctgca 


gcacaagcac 


attctaaaga 


97200 


tgcggacttg 


gtttcacaat 


tattcttctg 


97260 


aaagttttgt 


agtcgcccat 


acacctctgc 


97320 


tttgggggtt 


catatttagt 


ccccagagag 


97380 


tcagcaaaca 


tgttgtgtct 


aaatactttc 


97440 


cctcattcaa 


aggctgtaag 


agttgagata 


97500 


agaaaaatgc 


tcattagaag 


tttgtttttt 


97560 


ttgtgcatga 


agcttttagt 


aaaatccaat 


97620 


ttttctacag 


ctttttagaa 


gtaccctaca 


97680 


taattaccca 


gaagggcaga 


aaacagaata 


97740 


caaaaattaa 


gaaccatgaa 


atttgattga 


97800 


tatataattg 


attttttcct 


gccatctcca 


97860 


tgttagttct 


atcttggcct 


tcattggagt 


97920 


tggggtagct 


ggccaagact 


actgggccgt 


97980 


gctctatttt 


cagtcccgta 


aagtgggtct 


98040 


tctagtggaa 


aatgtttttt 


ctattctagc 


98100 


atagaataaa 


gtgagagggt 


tacctctaca 


98160 


ttggcccatc 


tcactgttaa 


tcacatgaac 


98220 


gcttctaaaa 


ttgtccaaaa 


attattttcc 


98280 


gtcaagctat 


tgagtgtttt 


ttgttaaact 


98340 


gacagggata 


agaacatctg 


atttttacag 


98400 


gaagcacgta 


tctctgaatg 


tgctcaaaac 


98460 


caagcatgga 


ctctgacctt 


aagaggaaat 


98520 


ctataaggaa 


tggagcaaca 


tggtataaaa 


98580 


tcaatggcca 


cattaaccat 


tttggtcttc 


98640 


gtggttttat 


tggtttcatt 


ttattttatg 


98700 
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ttggggttgg ttttgctttt gcttttcagt ttacaaaaag cagagtttat taaattactg 98760 

tgggctacat atatcttaag aaagacacag aatgacttcc ttatctatct agaacatatt 98820 

5 

aacattaaga caatatcaca atattttgaa caaataacca caataacatt tcctttatca 98880 

gttcactcaa tactgtgtaa gtaattctat ttcactggat ctcggagtgg caggcttatg 98940 

10 aagccatctg cttctcaact aaagggttct gaaatcctga ctccaaccat tggtataatc 99000 

tgaaagttgt ctaggcactc ccatcagatg cgtgtcccct agagtatgtt ctttgaagtg 99060 

tcagtagttt gaattgctgg cacactcttt ttccaccaag ctctgagaca atcctttgtt 9912 0 

15 

aaaaacatag gagagtttta ggccgggaat ttgaatgatc acttttgtgc ttggaaagga 99180 

caacactgtt tgcagtgtaa ggaaaagctt atataagaag gtgtattagt ttgttttcac 99240 

20 actgctgata aagaaatacc caagactgag aagaaaaagg ggtttaatga gcttacagtt 99300 

ccacatggct ggggaggcct cacaatcatg gcagaaggca aggaggagca agtcacatct 99360 

tatatggatg ggggcaggca aaaaaaatag agcttgtgca gggcaactcc tgttttttaa 99420 

25 

aaccatcaga tctcatgaga cccattcact gtcacaagaa cagcatggga aagacccacc 99480 

ccataattca atcatctccc accaggttcc tcccacaaca agtgggaatt atgggagcta 99540 

30 caagatgaga tttgggtggg gacacagagc caaaccgtat caggaggcaa aaatagatga 99600 

gaaagcttct cggtaaataa ggcagagtga agcaagaaca aaattcaccc tccaaaatac 99660 

caaacaaaag aacaaaaact attcaaaaaa aaacagcttt caaactggca tccaagtaga 99720 

35 

gaaacaaatc attctgatgc tactcaaagc tgaaaaaaag cacagctttt agacctggct 99780 

cttgtttccc tacaaacaaa actgaggaaa gaagggaagc caaaaaaaaa tactgagaag 9984 0 

40 tctcactaac actctccagt ctgtagctga gaaggctggg gtggagagca gtcaattctg 9990 0 

tgaaaggata aagcaaatcc tgtatggtac aagaccatga gtagtaccca ggacctcagg 99960 

agacagaaga aagggcaaca ccttcccata ccctgcatgg taccctacta ccagcaggga 100020 

45 ~ ' 

atgagcagaa atgcacagga atgcacagga ataaacctcc agccaaggag caggcctcca 10008 0 

tactaagcat ttagcaggct ctcagaagac agaaaaagtg aagtccatcc tcgctagcaa 10 014 0 

50 gaactgtgct cgaagccagc taagctatct ctctccctgc aacagctata tatgttcaca 100200 

ttcagaaaca tggatataaa acaaccgcta gggctcaagt tgtaatggaa gcaaggaaga 100260 

caaatgggag ccaaataaac agaagtgtca atgaaggaaa attcacctac actgtacctg 10032 0 

55 

atcaaataga aagcctgtat agagttcaag tgagcttaat acttttattt aaaatagaaa 100380 

taacatgatt ccattcttat agaatgatgc tcctctctct tggttattct cttttttccc 100440 

60 tctcacggca tctttccaca catccattct tctgcctttt ccctgtacac agagagctag 100500 
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gataatagtc 


atcaaacatt 


aattttggtt 




gttggtttct 


gtattgcttg 


acttttttaa 


5 


acgaggccat 


cacaaaggaa 


aaaaactacc 




tacttaccaa 


aaggaaagac 


ttctggcatt 


10 


ctctggaaca 


agcatttgat 


gatactggaa 




aatagcacat 


ttgatgagtc 


aataactttt 




caaaatttca 


ctatttattt 


atgatttcac 


15 


tgatcttctc 


cctcttggag 


ttcttcgtag 




caaacaccac 


aaccaatatg 


gtgagttggg 


20 


cctggagaaa 


tacagacttg 


tgtctgctaa 




tgtccttcac 


tctgatggta 


ccaggggaaa 




gtttttattc 


attctttcca 


gtctgtcctg 


25 


gtgacaccag 


cgtcttcttc 


agctcctccc 




aaatagtaaa 


agaaaaaggg 


gtatcagtct 


30 


cttcttaaga 


agatgtcttt 


tattgtctac 




gagatttgtt 


tttaggttgg 


tcgctaatga 




acattaccac 


tactacatgc 


tggcaaaggt 


35 


caactctctt 


aaagttagaa 


atgtttctgt 




ttagaaacaa 


aagccttttt 


cagtatgttt 


40 


tagagcagta 


gatctgatct 


atactccaag 




ctaggtgtcc 


agtcactgga 


gttccctaac 




gcaagtgtcc 


aattgccctt 


gttggaggtc 


45 


atcttgcctc 


ttaactatca 


tctacaaaat 




ttaaactcac 


ctgcaatggg 


acaacagtca 


50 


ttgagttttg 


gctttttgtt 


tttgaattgt 




aaggtaagac 


agaagggatg 


aatgacagtg 




aaaacatcac 


ctaagtcatg 


aaaaagggca 


55 


tcacagatat 


ccacagcttt 


gcaatagtgg 




atccctgtca 


aagcagtagt 


gatgtcatgc 


60 


gttgaacaga 


atggtaacaa 


gggaaaagtc 




acaggattgc 


caagcggtac 


agaaggcaca 



PCT/US01/48437 



atttctaaac 


actggcattt 


ggaatgatcc 


100560 


tgagtagtaa 


catttgtatt 


aaaaaaaaca 


100620 


ttctttctaa 


tatttctaaa 


caaaatagcc 


100680 


ttaaagaaaa 


tccaaaataa 


attcacactc 


100740 


gaataagcat 


ggagcgagtt 


gagtataagc 


100800 


tgaagcccac 


gtgaggccct 


cctgattagc 


100860 


tttctggaaa 


aggcatttca 


gccacgctga 


100920 


cttgtgccac 


agcccatttt 


gccaaccaag 


100980 


tctcttcttt 


gtaaaataat 


ctgaaaatgc 


101040 


atcctactat 


cggcttacca 


gaatgctctt 


101100 


ttgcatagga 


ttgtagtaac 


catattgctt 


101160 


gttattccaa 


atatgtatga 


aagcaaccct 


101220 


agatgcaaca 


actactcagc 


taatgcccct 


101280 


aatctcatgg 


agaaaaacta 


cttgcaaaaa 


101340 


aatgatttct 


agtctttaaa 


aactgtgttt 


101400 


tggctgtatc 


tcccttcact 


gtctcttcct 


101460 


gaaggatcag 


aggactgaaa 


aatgattctg 


101520 


tcatattact 


ttttccttaa 


taaaatgtca 


101580 


aaaaaattaa 


ttttgatttg 


atattggttg 


101640 


aagattgata 


aacctttgat 


gatgattttt 


101700 


tcatctcctc 


ctcatctttc 


tgttcagaaa 


101760 


atattattct 


atacattaat 


atcatcaaac 


101820 


tttgctattt 


acagagtatt 


ttcatgtaca 


101880 


gacagggatg 


ctgcattgag 


gggttttcat 


101940 


tggcacattt 


ataagccaat 


tagaacaatc 


102000 


taagtttcaa 


gaaatcaaaa 


gcacagatgg 


102060 


gtttctctac 


aaaatagaaa 


ggtataatta 


102120 


cagtgaggat 


ccgatgtgat 


ggctcatgtc 


102180 


ctaaggtaag 


ggcacaggta 


gagtcagaat 


102240 


atggaactgg 


agaataaagt 


ggtcggagaa 


102300 


tttgaggatg 


tggtaacaaa 


ttcagagtat 


102360 
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ggacaatgct tctgcaaggc cttgctgcta gatccaggca actacggatc caggaaacta 10242 0 
tttgattttt tccagggtta agattttgct agaggagatg acaaaatagc agaagagcaa 1024 8 0 

5 

gaaagcttag gacattgtaa aaaatattat ttaaatgatg gactgtggaa tctaaactgg 102540 
ataaggaagg aataatagta ataataaaag caataataaa gaaaaaagac agattgatca 102600 
10 taaatggaga aatctgcctt ttctccggga gctgtcgtga caatcaaatg agataatgtt 102660 

atatgaatat ggtttaaaaa tagcaaagta ttgtcccttt ttgaaatttt aatattatta 102720 
aagaagccag aagggaaaaa tataacaaaa agagtaggca gagaaatcaa aatcagggaa 102780 

15 

gaggaagatt agaagaacag aatttttgca tctgggaaag cagatgtatt atgtcaggca 102 84 0 
agagtcacat aatttcaacg tcattggtct gcctaggtcc caactattca gcaactgggt 102900 
20 gattggaaat gctgtgtcct tcaccataaa gctcaacagc tcagcaaaag tgaagacaaa 102960 
gatttcatgg tctaagggtc tggaaaataa atggtgcctc aaaatgtggg ggaggggatg 103 02 0 
gtatattgta tatctataca cgtaaaataa acaaatataa aagtcactta ctttagggaa 103080 

25 

cagaagcatt tccagcaagt tcattttt.ta agtaaatttc tatcaaatat attctatttt 103140 
tcaaattgct taataatagt aataataaca gcatatattt tagcccattc cttgtaccac 103200 
30 tcacgttcca agcactgttt caatgctcat gacaaccacc tgtgaggcag gtggatcatt 103260 
ttcagatgaa aaaactttgg cagaaaagtt aaaaatcttg tcctaagata tgtagctagc 103320 
aaatggtgaa gcatggattt gaacccaggc agtatgtttc ccaggtccgg gcagttgccc 1033 8 0 

35 

taaccattat tcccactatt cttatggtct tggatagaat cagacacttg taatgccaaa 10344 0 
acaaacagaa atgtttctct atagtaagct tgtaataaat ttggaagaca ttgccttatt 103500 
40 ctctgagaga ggagatttag taacataata atacagttga cacttaaaca acatgagttt 103560 

gaactgcgtg gctccactaa tgtgcatatt ttttcaatgt gggaaaagtc atggaaacgg 10362 0 
agaataaact gatcagagaa acagaggatt accaagctat actgagggta catttgaggg 103680 

45 

ggttttagca aatttatact atgaacaaat ttctccagct gggcctgtga tctgagaccc 103740 
tagaaggtga ctggaataga aatatgatat tgccctctcc ctctccctcc ccctccccct 103800 
50 ccctcatgcc cagccgaggc tggactgtac tgccaccatc tcgactcact gccacctccc 103860 

tgcctgattc tcctgcctca gcctgccaag tgcctgggat tgcaggcatg tgccgccacg 103920 
tctgactggt ttttgtattt tttggtggag accgggtttc gccgtgttgg ccggtctggt 103980 

55 

ctccagctcc tgaccgggag tgatctgcca gcctcggcct cccgaggtgc caggattgta 10404 0 
gacagagtct cgctcactca gtgctcaatg ctgcccaggc tggtgtgcag tggcgtgatc 104100 
60 tcggctcgct acaacctcca cctcccagct gcctgccttg gcctcccaaa gtgctgagat 104160 
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ggcagcctct gcccggctgc caccccgtct aggaagtgag gagtgtctct gcctgg'ccgc 10422 0 

ccatcatctg ggatgtgagg agcccctctg cccggctgcc cagtctggga agtgaggagc 104280 

acctcttccc ggccaccatc ccgtctagga agtgaggagc atctctgccc ggccgcccat 104340 

cgtctgggat gtggggagca cctctgcccc gccgccccgt ctgagatgtg aagagcgcct 104400 

ctgcccggct gcgaccccgt ctgggaactg aggagtgtct ctgccccacc gccaccccat 104460 

ctgggaggtg aggagcgtct ctgaccgccg ccccgtctga gaagtgagga gcccctcggc 104 52 0 

ccagcagcca ccccgtctga gaagtgagga gcgtctccgc nnnnnnnnnn nnnnnnnnnn 104 58 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnaaaaa aaaaaaaaaa 104 64 0 
aaaa 104644 

<210> 80 

<211> 95109 

<212> DNA 

<213> Homo sapiens 



<220> 

< 2 2 1 > genomic_DNA 

<222> (1) . . (95109) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (60039) . . (60257) 



<220> 

<221> coding_region 
<222> (62556) . . (62678) 



<220> 

<221> coding_region 
<222> (66166) . . (66225) 
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<220> 

5 <221> coding_region 

<222> (67828) . . (67959) 

10 

<220> 

<221> coding_region 

15 <222> (74200) (74313) 



<220> 

20 

< 2 2 1 > coding_region 
<222> (74489) . . (74590) 

25 





<400> 80 
nngtttcaaa 


ttcctgggct 


caagtgatcc 


acccacctcg 


gcctcccaaa 


gtgctgggat 


60 


30 


tacaggtgtg 


agccaccaca 


cctggcccct 


cataaacttt 


ttgtaaagcg 


tcaatagttt 


120 




atgaattcat 


tagtcttcca 


cccgggcttt 


gacatcaatt 


ttatgcttgt 


tctttcttca 


180 


35 


attttagcag 


aattcatgtt 


tctctggtag 


gggctctttt 


caaactgact 


tcttatcctt 


240 


ct.aagtgcct 


caaactaaat 


cctgttaaga 


catgttataa 


caattttgtg 


tgagtttatt 


300 




ttggtgcaaa 


aattttttga 


aatccgtgcg 


tagtttttcc 


atagtatgea 


tttcccacaa 


360 


40 


actttttgaa 


gaccccttgt 


attaggttga 


accatttgaa 


attgetgata 


tcagaccatt 


420 




tttaatctac 


aaaattgtca 


gtattgtatg 


atccaatcca 


atgtattaga 


ctattccaat 


480 


45 


atatgcaatt 


tggaatatga 


tccttgtgaa 


atcaagagga 


aaataataac 


atagatatat 


540 


aggttttaga 


atactatgaa 


ttaccaatat 


tgacacagtt 


tttgcactaa 


gtttcctgat 


600 




aggcaaagtg 


aaaatgaaat 


tcaatcatta 


tgtgaataaa 


ttatctatta 


ggcagaaatg 


660 


50 


tttcagttca 


ttatggataa 


aatgattcct 


gtaatagtac 


tttataacca 


tttattcgtg 


720 




tatagttcag 


tataataagc 


agtatataae 


agtatataae 


agttatatat 


atactgttat 


780 


55 


atactttccc 


ccgttctata 


attaaaaatt 


agaatacatg gcggggcacg gtggctcacg 


840 


cctgtaatcc 


caacactttg 


ggaggctgag 


gcaagtggat 


cacgaggtca 


ggagattgag 


900 




accatcctgg 


ctaacacggt 


gaaaccccgt 


ctccactaaa 


aatacaaaaa 


attagceggg 


960 


60 


cgtggtggca 


ggcgcccgta 


gtcccagcta 


ctegggaggt 


tgaggcagga 


gaatggcgtg 


1020 
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aacccaggag 


gcggagcttg 


cagtgagctg 


agatcgggcc 


actgcactcc 


agcctgggcg 


1080 




acagagtgag 


actccgtctc 


aaaaaaaaaa 


aaaaaataga 


atacgtaact 


ctggttacat 


1140 


5 


tttctgtttg 


gagatactga 


gcattctcag 


tctggatctg 


gttgtcttga 


gttaaggaat 


1200 




tatttcaccc 


aagaaaactt 


ccaggagtgg 


tgacagattc 


tccaaagtag 


acataataag 


1260 


10 


caagatggat 


gcaaggagga 


agcataggtt 


gatagcgtaa 


tattttaaat 


gctgactcaa 


1320 


agaaatcatt 


atttcattac 


taagagtaaa 


attagcgtga 


agtataagca 


gccccatagc 


1380 




ctagtcttct 


ctagatcaat 


aatagttcta 


aacatactgt 


actctaacta 


cattatttct 


1440 


15 


gagtaatttc 


tcctgaatga 


acacttgaaa 


tttgcataac 


ctgccattag 


agagaaggca 


1500 




tacttcaatt 


atccatattc 


ttataaagtt 


gagactgcat 


aaactaatgc 


ccactaggtt 


1560 


20 


aacacgaaat 


gttttaacaa 


cagctcagcc 


ttcagttttc 


tagaaggtcc 


agtttccaaa 


1620 


aaattatagc 


attctttatg 


cctaatgtcc 


tagctacaaa 


taatgatact 


gcatagtttt 


1680 




ttcacaatat 


gaattttatg 


aaatatttta 


ctgagattat 


ctcattagat 


ctgttaaatc 


1740 


25 


tctaccctat 


ttttatttcc 


tttatctcta 


cactctactc 


tcaaatatct 


cccttcatga 


1800 




ggaaccattc 


ctttgtattt 


aatataaatg 


gttttttaaa 


accacttttt 


aattttaagc 


1860 


30 


tactttatat 


gattaataca 


taaaaatgta 


tacatatgta 


atgcatagat 


ctgagtgagt 


1920 


ttggggatag 


gtatacacct 


atgaaactat 


caccaccatc 


aaggccataa 


acacatttat 


1980 




cacatcccga 


aggttcttcc 


cacacccttt 


attattattg 


ctttttactt 


tttctttggg 


2040 


35 


tattaacatt 


taacacaccg 


ctatcccctt 


agcaatttta 


agcacacaat 


acagttttat 


2100 




tcaccacagg 


cactatgcta 


tagagtagat 


ctccagagct 


tagttatttt 


gcataactaa 


2160 


40 


aatgttgtga 


cttttaacca 


tcatcttccc 


atttatccct 


cccacaagcc 


actgacagct 


2220 


actattctac 


tctctgcttt 


taagagtttg 


gcgattttag 


gctattatct 


aaaagaaaaa 


2280 




tacagcagat 


gctggtgagg 


ttgaggagaa 


aagggaacac 


ttctacactg 


ttggtgagaa 


2340 


45 


tgtaaattag 


ttcagccact 


gtgcaaagca 


gtttggagat 


tgctcaaaga 


actttaaaca 


2400 




gaactaccat 


ttaactcagc 


aatcccatta 


ctgagtatat 


atccaaaagg 


aaataaatca 


2460 


50 


ttccaccaaa 


aagatatatg 


cattcacaag 


ttccatcaca 


gcactattca 


caatagcaaa 


2520 


gacgtgaaat 


caatccaagt 


gcccattaat 


ggtggattgc 


ctaaagaaaa 


tgtaatatat 


2580 




acaccatgga 


atactacaca 


ggcataaaaa 


agggtgaaat 


catgtccttt 


gcagcaacat 


2640 


55 


ggatgcagct 


ggagaccatt 


atcctaggca 


aattagcaca 


gaaacagaaa 


accaaatgcc 


2700 




gcgtgttctc 


acttgtaagt 


gggagttaaa 


cattgtgcac 


acatggacat 


aaagattgca 


2760 


60 


acaatagaca 


ctaagactat 


caaaaggggg 


aggaagaaag 


ggggaaaggg 


ttgaaaaagc 


2820 


atagtgggta 


caatgctcac 


tacctgggtg 


atggaatcaa 


tcatacccca 


aacctcagca 


2880 
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tcacccaata tacccatgtt tcaaacctgc atgtgtaccc ctgaatctaa aataaaattt 2940 

gaaattataa aaaataaaat tatgtaactt aaaaaaatat tgctgagatg gataaataca 3 000 

taaataaaat ggaaaaaaat ttaagagttt agattccaca tagtgagatc atacagtaca 3 060 

gctatttgtc tttctgtgtc tggcttattt cacttaatat cctccagatc cattcatgtt 3120 

ctcacaaatg gaaggatttc tttcttttct aaggccagat tacacacaca cacacacaca 3180 

cacacacaca atttattcac tcatccattg gtaacattta gcttgttcca tatctgacta 3240 

ttacgaataa tgctgcaatg aaaatggaag tgccaaattt caaatccttt ggctatatat 3 300 

ccaaaagtga gattgtttga tcatatgtca gttctatttt taacttttct agggacctcc 3360 

atattgtttt caataacagc tatagaattt acattctcac caacagtgtg taagtgttcc 3420 

ctattcacca catcctcacc agcacttgtt atcttttgat ttttaataat agccatccca 3480 

acaaatgtaa gatgctatct cactatggtt ttgatttaca tttccctgat tattagtgat 3540 

gttgagcaca tttgcatata cttcttggcc atttctatga cctcttttaa gaaatgccta 3600 

ttgagaaagt catggaataa acttaagtgc ccatcaataa atgattggat ttttaaaatg 3 660 

tggtgtgtat atacatattg tggaatacta ctttgctgtg gaaaagaatt aaatcatgtc 3720 

ttttgcagca acatgaatga aactgcggcc attatcctaa atgaaatgac tcagaaacag 3780 

aaagtcaaaa accacacatt ctcatttata agtgggagca aaaaagttgt tacacacgga 3 84 0 

catacagagt agaataatac aaattggaaa ctccaaaagg tgagagtgtg gaagggagtt 3 900 

gaggaatgaa atactaccta ttgggtacag tgcacagtat ttgggtgatg agtacactaa 3 960 

aagccgaggt ccagacttca ccacttacaa tatgttcatg caacagaact gcacttgtac 4020 

cccataaatc tattttttta aattgcccat ttttaaattt ggttgcttta ttttttttat 4080 

tctgttgtat gtgtcctttg tatattttgg atattaaatc ccttatcaga tatatggttt 4140 

gcaaatattt tctctcattc cataggttgc tttttcattg tttttcttac tgtgtggaag 4200 

ctttttagtt tgatgtagtc ccacttgttt atttttgctt tgttgcctat gctttacagg 4260 

tcatatccaa aagacatcat tgccaagata aatgttaagg atctttctct ctgtgttttc 4320 

ttctaggagt cttagggttt cagattttaa aataaattac tttattcatt tttacttaat 4380 

ttctgtgttt gatataaaat aaggctccaa ttcatccttt tccatgttaa tatccagttt 4440 

tactaacaac atttattgaa gagactaccc tttccccatt atgtagtctt tctgcctttg 4500 

ctgaaaatta gctggccatc tatgcatggg tttatttctg ttctctctgt tctgttccat 4560 

tagtctatat atctgttttt atgtcagtac cactctgctt tctttactat agctttgtaa 462 0 

tatagtttga aatcaggaag tgtgatgcct ccagctttgt tcttgctcaa gattgctatg 4680 
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gctattcaga gtcttttgtg agtccataca 
aagtatactg cttttgatag aatagacatt 
5 catgggatat ctttccattt atttgtgtct 

ttttcagtgt acaactttgg catttccttg 
ttctctgtta aacatcattt tgtgtgtgtt 

10 

ctatgttctc ttgcagctca ctgatgttct 
gcagttcata gatatccaat tttaaaggtc 
15 tggtgtccta tttccttgat actttgtgtt 

ttgaagaagc agtcaatgtt tctaatcttt 
acagtgagcc cagccagaga ttcttggggc 

20 

ctatttctct tgttccttct tagtgggcaa 
aaagtcaggc ctgatgctga gagcctgcca 
25 tttaaaggtt gtgcatcttt tttttttttt 
cactctgttg cccaggctgg agtgcagtgg 
cccaggttca agagattctc ctgcctcagc 

30 

ccaccatgcc cagctaattt ttttgtattt 
caggatggtc tcggtctcct gacctcaggt 
35 gggattacag gcgtgagcca ccacgccctg 

gccagttggt aagatccaca gctgctgttg 
ctgagctgtg catgggcatg tgtggggtac 

40 

agaatgtgtt ggctatttgg gaagccctct 
gcctcttggc ggagttcaag agctggttat 
45 aatgtgtttg taaaacatct ttggttttca 

tgagcaggta tcatctccac tctacaaaag 
agtgctgtac ccaccacatc ttgcagagga 

50 

tgctgcttct aagcgggggt gtgtctgggg 
gccgacagat caagcacagc ctcttcctcc 
55 atgcccacgc agccaggaaa gagcggcatc 

ggagcctgca ggctccgtgg tgccacggga 
tctctagcac agccccgctc ttgggcgtta 

60 

ccgaggaagc agaccccacc gggctcccga 
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aattttaaga 


ttgttttttc 


tatttctgtg 


4740 


ttaacaatat 


tagttcttcc 


aatctatgaa 


4800 


actttaattt 


ctttcatcaa 


cattttatag 


4860 


gttaaatgta 


tgccagtgtt 


ttcacaattt 


4920 


tattgttttc 


ctggttttgt 


taaattgtct 


4980 


ttaagatgat 


tattttgaat 


tctttgtcat 


5040 


aaccacaaga 


gcttcacttt 


gttcttttgg 


5100 


cctcaaagct 


ttgcattact 


gtcttcacat 


5160 


actggttttc 


ttcagtagag 


aaagaccttc 


5220 


ctcatgtact 


ttttctataa 


tgtgcacact 


5280 


gtctcaggat 


tgtgtgcctt 


ctttcaatcc 


5340 


tttattttcc 


ctaaggcaat 


gtcctgaagt 


5400 


tttttttttt 


ttttttttga 


gatagggtct 


5460 


cacaatctcg 


gctcactgca 


acctctgcct 


5520 


ctcccaagta 


gctagggcta 


caggggcatg 


5580 


ttagtagaga 


tagggtttca 


ccatgttagc 


5640 


gatccgccct 


cctgggcctc 


ccaaagtgct 


5700 


cctcatcttc 


tttcaaagca 


gcagtgtcaa 


5760 


agatccaagt 


gctgtctgtg 


ggagctcatt 


5820 


tattcaaggt 


ggggttccca 


gggtgctagg 


5880 


gacaagtcat 


tctgcagggt 


ttatgggcag 


5940 


cccagtggaa 


tatgtttgta 


attgcagtgg 


6000 


aagttctgca 


gcatgagcat 


atgaggtacc 


6060 


aataaactga 


ggaccaaagg 


atttaagtgc 


6120 


ttctaagtgg 


caaggagctt 


ctgacaatga 


6180 


ttagcagtgg 


ggccaccacg 


tgatcaatcc 


6240 


tgggtgcgcc 


aggcccaagg 


acagaggatg 


6300 


attgtcagcc 


cctaccccac 


cctcgtcttg 


6360 


gccccaccgc 


acctcacgta 


tccgcgcgag 


6420 


gaaccccgcg 


aggggggatg 


aaggacgcca 


6480 


ggctgccagg 


ccccccgctc 


agggcgctgc 


6540 
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ctgcagaacc acgcccccct gtccctgtgc 
ggcctggtta ccattcgcga ctccctcgtc 

5 

ctagtaacca tggtatagtt gagacgatcc 
aggcgtctag gaggcttacc tggcattgcc 
10 aaacaccagc ggggctgttg gccttgtgag 

gttctcagca tctgcccaat ttgagctcct 
ttcctaatgg ttcaggtggc cataagatat 

15 

aagatctctg tttatcacct tgagatgttc 
tcacatagcc tagattttaa gtcgttgcat 
20 agcttgctct ttaggacagt agtaataaaa 

gttacatact agacactttt ctaaattatc 
ctcctatact aagacctcaa atagcatgaa 

25 

actgcaggaa cacatagaaa cttcgttgta 
ccaaaaaaaa aaaaaaatgt accatgcaca 
30 tatgacaaaa acatacaaaa aaatgactta 
tggagtatgg tttgagaaat tagcctggaa 
cttgtatgtt tgcagactat tctaaagtca 

35 

tatgaagaat tgctgtggct gagatgtggg 
atgaatatta gaggacagat gaccgaataa 
40 tgagctcctt gcgaactgag ccagtgatct 

cctcaatata gtaacagatt tggatttgga 
tggttcagct gggttggact ggctttgggg 

45 

gcagtgaggt attcaagttt aacaatactt 
cactgaggaa tagaagttta aattaggaat 
50 gctcatggag aaggttagag gagagttcca 
tcattcctaa gatataagac ttatgtcagt 
tgacatcagt gaaaatggca cagtggggaa 

55 

ttcagaagca acattctcac aactctgaaa 
gcttgatcaa gaaaacacaa ggaaatgttg 
60 cctagtctca tttcccactc cccagctcgt 
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taagagggtg 


acgcgagtgc 


ccgaagccct 


6600 


ctgggttctc 


ccattgccag 


gaatgtttta 


6660 


actcttgtcc 


tacgcatttc 


ggcacaatac 


6720 


tgagagaatc 


ttccatccat 


ttaagtaagt 


6780 


ttgtggcacc 


gtgaaaaacc 


aaggcctgca 


6840 


gtttgaaacc 


actgttgggt 


ttcccttctc 


6900 


cctcttaagc 


ttgagtgtca 


gttattttat 


6960 


tgcagggtcc 


ctgaagtgaa 


tttcgtgtat 


7020 


tccagactta 


ccattcaaat 


ttggagtttc 


7080 


ctacaatcat 


gactacattt 


attgcacaat 


7140 


tattaatact 


aattttatga 


atgccatgag 


7200 


atgaaatcca 


gtcagtttag 


ctgtttgcaa 


7260 


cttatacatt 


tatttgacag 


tcactgagct 


7320 


gtgcttagaa 


tctacagtta 


caaagatgaa 


7380 


ttggtggaac 


tgaaactgac 


cagaaatggc 


7440 


aggaagctgg 


agggcacatc 


gtgaaagcct 


7500 


gtagggagcc 


tctaaaatgg 


tatgatgata 


7560 


gaataggctg 


aaggagcaat 


gaaatgaata 


7620 


gaaatcttca 


taatagtcca 


agcaaaaaaa 


7680 


ttgtctacca 


tttctatctc 


tacacccacc 


7740 


gttttccatg 


ttttaatggg 


ttggttgatt 


7800 


gaagactaaa 


ggggaggaag 


ttggaaggga 


7860 


cagttggatt 


tctgggcacc 


attttaattc 


7920 


gggacaaaag 


ccaagtgcag 


gatatggaaa 


7980 


atgcagtgag 


catgtgggaa 


cccaaaagac 


8040 


gaaagtaatc 


taaaacgaca 


tggttagagg 


8100 


caccaagcaa 


ttaataagct 


ggaaaaaact 


8160 


attaacgaaa 


acttgcagca 


accaagggat 


8220 


gaaacagatt 


tttgtgacat 


tttaatttat 


8280 


cagtgccctt 


gaataaaaca 


gccttcattc 


8340 
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tctgtatagg tacccagtaa cagagggagc 
ctgcttgtta gatctacctg gtggttccat 
5 ctcgcctaaa acagaactat cccggagctc 

taaaggaaaa aacataatag tcattgattt 
aatagactaa ctgaaaagtt tgggagaaaa 

10 

actcccacat atttctgata atctagaagg 
tcagaaaaga cctgaaaaca tcctaaggtc 
15 aagtaggaag taaaggataa gcagagtggt 

atggtgtgag aaaagggtcc aatttcatta 
accatttatt gaagaggctg tcctttctcc 

20 

tctgttgacc ataaatgtgc aggtttattt 
tgagtctgtt ttcatggcaa taccatgttg 
25 caagattata tagtgtgatg acttcagctt 
gttggggtct tttgtgattc cacacaaatt 
agattgaatt ttgatagaga ttgcattgaa 

30 

ttcaacaata ttaactcatc caatccatga 
ttcttccatt ccttttatca atggtttgta 
35 ggttaaattt gtttctacat attttttata 
attttttgga tagttcattg ttagtacaaa 
tttctttttc tttttttctt tctttttttt 

40 



50 



60 



tgtcacccca 


ggctggaatg 


cagtggcatg 


ggttcaagtg 


attcttatgc 


ctcagcctcc 


catgcccagc 


taatttttgt 


atttttagta 


ggtctcgaac 


tcctgacctc 


aagcgatcca 


atagacatga 


gccaccacat 


ccagccgcat 


tttgcttatt 


agttctaata 


gttttttggt 


catgtcatca 


tgtctatggg 


gttttctata 


tgtgtccagg 


aagtggaaca 


tggagtcaag 


attctttacc 


cagttgagtt 


tttctttgtt 


tatcctatgc 


ctgtcacact 


attgtgtttt 


cccacagctg 


gagggaaact 


tatctcagga 
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aaagcaaatc 


ttatcctcaa 


agaactgcag 


8400 


agagaaactg 


ctcagagaac 


ctgcctttac 


8460 


aggtaagcag 


tggcatttgt 


caaaaacatg 


8520 


ctgaggcaag 


gaataatggt 


tgggacaaat 


8580 


ggccagggaa 


tgagatgctt 


tgagtaacaa 


8640 


ccacagacat 


gccctgggct 


agacacatgc 


8700 


tcaccttcgg 


ataatgttca 


gactctgcac 


8760 


ctttaatgca 


ttttgagttg 


atttttgtat 


8820 


ttctgcatgt 


ggatacccag 


ttgtctcaac 


8880 


attgtgtgct 


tttggcatct 


ttgccaaaaa 


8 94 0 


gtgagttctt 


cagtctgttc 


cattggtcac 


9000 


ttttgattat 


tatagctttg 


taatatattt 


9060 


tgttcatttt 


gctgaagatt 


gctttggcta 


9120 


tcagattttt 


tctctatttc 


tgtgagaaat 


9180 


tccatagact 


gatttgggta 


gtatggatat 


9240 


acatgaaata 


gcttttcatt 


tatttgtgtt 


9300 


gttttcagta 


tataaatcat 


ttaactcctt 


9360 


ctactgtaac 


tgggtttgtt 


tccttaattt 


9420 


gaaaaccttc 


tgattcttgc 


atgttgattt 


9480 


tttttttttt 


tttgagatgg 


agtcttgttc 


9540 


atcttagctc 


actgcaacct 


ccacctccca 


9600 


tgagtagctg 


tgattacaga 


tgtgtgccac 


9660 


gagacaaggt 


ttcaccatgt 


tggccaggct 


9720 


cctgccttgg 


cctcccagag 


tgctgggatt 


9780 


gttggttttt 


aaccttgaaa 


ccttactgaa 


9840 


ggaatctatg 


ggattttcta 


tatataggat 


9900 


cataggattc 


atttcctatc 


tggatgccaa 


9960 


caagattatt 


ctgtagactt 


gagatttaac 


10020 


gctatttctc 


ccttttagaa 


tgaaaatatc 


10080 


ggaagcacaa 


aacttattcg 


atttcacagg 


10140 


tgaatcatgc 


cttgaatctc 


atgcctatat 


10200 
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ctgatttaga tgatactctt gactttggac 
tttgggtctc tagggataga atgactgcat 

5 

ctatgggata aatgctatgg tttgaaggtt 
ttgccgtttt atcaatatta agaggcaaga 
10 tgctctcatg aatggattaa tgccttatca 

ttcattataa aaggaagagt ttggtcctct 
cctttccacc atgggatgac acaacagtaa 

15 

tagacttccc agcttccaaa actttgagcc 
acagcacaaa atggactaat acaacttgag 
20 atgttccatt agtacttgag aagaatgtgt 
atttgtatat taggtctatt aggtctacag 
ttttctgtct ggatgatctg tccattgctg 

25 

ttgaattgca gtttattgct cccttcatat 



35 



45 



tatggtgctc 


caatgtcggg 


tgcatatata 


gaccccttta 


tcactatata 


attactttgt 


tcttgacaca 


tataagatag 


ctacccatgc 


tccatccctt 


cacttttagt 


ctaagagtgt 


tccatagagt 


tgggtcttat 


ttgtttattc 


tttaatctat 


ttacatttaa 


ggtaatgttt 


gtatttgttt 


actgactgtt 


ttgtagttcc 


gtgatttgtt 


gaatttttgt 


agtgatatgc 


gctataggct 


ttctttgttt 


tgttttgggt 


tagggtacat 


gtgcacaacg 


tgcaagtttg 


tgctgcaccg 


atcaactcat 


catttacatt 



55 



agccccctga 


caggccctgg 


tgtgtgatgt 


ttcaactctc 


acctatgagt 


gagaacatga 


tgctgagaat 


gatggtttcc 


agcttcatcc 


tttttatggc 


tgcatagtat 


tccacggtgt 


tcattgatgg 


acatttgggt 


tggttccaag 


acatacgtgt 


gcatgtgtct 


ttacagtagc 
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ttttgagctg 


ttgctggaac 


aagttaagac 


10260 


tttgtatatt 


aggacatatg 


ttttggggga 


10320 


tgtcccctcc 


caaatcgtgt 


tgaaatttag 


10380 


cctttaagag 


tgattcatcc 


atgatgtctc 


10440 


ctggagtggc 


tttgttattg 


agggaatcgg 


10500 


tttgcctctc 


tatctctcac 


catctctttg 


10560 


gacccttgcc 


agatgccaac 


tcctcaatct 


10620 


aataaattac 


acagtctgtg 


gtattctatt 


10680 


aggagcctaa 


tatatgatgt 


atcctggaga 


10740 


attctgctac 


tattagaaga 


aatgttcttt 


10800 


tgtagttcaa 


gtccaggatt 


tccttaatga 


10860 


aaagtgggat 


attgaagttg 


cctactatta 


10920 


ctattaatat 


tatatatatg 


cacacattta 


10980 


tttacaattg 


ttatgtcctt 


ttgatgaatt 


11040 


ctctttttac 


gattttttac 


ttaaagtcta 


11100 


actggtctcc 


ctttgtgtgg 


aatatctttt 


11160 


ctttaaaggt 


gaagtgagtc 


tcttataggt 


11220 


attcaaccac 


tctgtctttt 


gattggagaa 


11280 


gacaagtgaa 


gagttagtat 


tgccattttg 


11340 


tttgttcttc 


caatcttgct 


ggtttccttt 


11400 


tttgattttt 


tttttatctt 


tttcacatct 


11460 


tttttttggt 


ttttctaata 


ctttaagttc 


11520 


ttacataggt 


acacatgtgc 


catgttggtt 


11580 


aggtatttct 


cctaatgcta 


ttcctccccc 


11640 


tccccaccct 


gtgtccacgt 


gttctagttg 


11700 


ggtgtttagt 


tttctgtcct 


tgtgatagtt 


11760 


atgtcctggc 


aaaggacatg 


aacttatcct 


11820 


atatgtgcca 


cattttctta 


atccagtcta 


11880 


tctctgccat 


tgtgaatagt 


gccacaataa 


11940 


atgatttata 


atcctttggg 


tgtataccag 


12000 
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tagtgggatt 


gctgggtcaa 


atggtatttc 




tgtcttccat 


aatggttgaa 


ctagtttaca 


5 


ttgtccacat 


cttctccagc 


acctgttcgt 




ctggtgtgag 


atggtacctc 


attgtggttt 


10 


atgagcattt 


tttcatgtgt 


ctgttggctg 




tcatatcctt 


tgcccacttt 


ttgatggggt 




tttgtagaat 


ctggatatta 


agccctttgt 


15 


attctgtagg 


ttgctgctat 


aggctttctt 




tcttgtagat 


atacctattt 


taagctgatg 


20 


ctacactttt 


acaccccata 


ccacatttta 




aaagacgaaa 


ataatgggag 


tgggtcagct 




taaggtcatt 


acagaagaat 


aaaatctaag 


25 


ttgctgttta 


gcagagaata 


gtgtcagtgg 




gctaagcagt 


ggaaaatcag 


aggcttccat 




gtattacttg 


ttctctatat 


ttctctatag 




tatacaatat 


ctctgttaaa 


atgctgagaa 




tgggggtaag 


gactaagtgc 


tacttgggtt 


35 


agaaaaagga 


aaagacattg 


accacatctt 




cctaaggtga 


tgataactag 


cctaaagcta 


4U 


acattccgga 


atagttcaac 


gttaccacaa 




tgtttgctga 


tatatgagag 


gtccattttc 




gctcatcaat 


cataaagtgg 


tcaggtaatg 


45 


ggggcactgt 


actgtcaaca 


tgtgcccctt 




tcaatgatct 


cccgctcagc 


tctcctcaat 


OO 


aatatttgtg 


ggaccttgaa 


tcacatttgc 




aggaaaattc 


tcaacgtatc 


aaatatatta 




acactatcag 


aatgcagggc 


atacctacat 


55 


taacgatgtt 


agctagggtg 


atcaatccag 




acaatgaata 


gaggcaaaaa 


tgaatatgtc 


60 


ttagtactgt 


catgccccat 


gattaaggtc 


aagcctatga 


atggcccaga 


ctcttcagga 



PCT/US01/48437 



tagttctaga 


tccctgagga 


atcgccacac 


12060 


gtcccaccaa 


cagtgtaaaa 


gtggtcctat 


12120 


ttcctgactt 


ttttatgaac 


accattctaa 


12180 


tgattcgcat 


ttctctgatg 


aacagtgatg 


12240 


cataaatgtc 


ttcttttgag 


aagtgtctgt 


12300 


tttttttttc 


ttgtaaattg 


gtttaagttc 


12360 


cagatgggta 


gattgcaaag 


attttctccc 


12420 


ttgtatttac 


catgaggctt 


aaataaaaaa 


12480 


acaacccaac 


ctcacttgca 


tacaaaacct 


12540 


catttttctg 


ttacagttta 


cttctctagg 


12600 


acatcaaaag 


gtatctactc 


agtctcattt 


12660 


aggtagcttt 


tggcttttgc 


caggaagtta 


12720 


gttgatgcaa 


aaagcttttt 


gcaaggaatt 


12780 


ttcaagtgcc 


ttatctcaaa 


ataatcatta 


12840 


caaaggagtc 


cattcatcct 


ctataactgc 


12900 


gatttatcct 


aaaagaaggc 


accaaagcaa 


12960 


aataaatgac 


tagtgttgct 


tcttcctaat 


13020 


ccccaaagga 


aggagaattg 


acaaaagtgt 


13080 


attggactca 


aatgtgacgg 


acccaaaaaa 


13140 


tccacccagg 


gttctaacaa 


atacattctg 


13200 


tgcctgtgtc 


tgtaaacagt 


ttgaatcaaa 


13260 


ggcttaaaac 


cagaaaaggc 


tgtagaccct 


13320 


acattcagct 


caggtgggtc 


acattctgct 


13380 


cttagaatac 


tatacccttt 


tgtattatcc 


13440 


atttaagtca 


gcaacattaa 


accttgcact 


13500 


ttatctccag 


ctggttgaga 


cctacccagt 


13560 


gcctctgagt 


caacaggcaa 


aaatgagagt 


13620 


gatgccaagg 


ggaaattgga 


ccactacttc 


13680 


tggaatacag 


gagatccttt 


aagatgtctc 


13740 


aatgggaaat 


tacaacaacc 


caatccaggc 


13800 


ttaaaggttt 


gggttatcct 


gtcaggtaaa 


13860 
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gaaacatgac 


cagccaaggt 


tgeataatae 


5 


gcactgtagc 


ccctttctga 


ggcattcctg 


ggacagaact 


tcaagcagta 


cacgaggttg 




gtgtgtgact 


atatactaat 


tcatgggctg 


10 


cttaaaacaa 

V*-" L~ \— (—4. CI CI W w*. 


atataattaa 


aaaagtggtg 




tagacttctg 


aataggcaaa 


acacataaag 


15 


agagcatgat 


tctcctgcct 


cagcctccca 




tgcccagcta 


atttttgtat 


ttttagtaga 




tcttgaactc 


ctgacttcag 


gaaatccacc 


20 


aaacataaac 


caeca tgccc 


agecagcaga 




ggcccattct 


gtggatacca 


ctcagcctct 


25 


gcccattaac 


aaagtggcca 


tggtggcagt 




ccacatgggc 


ttccactcac 


caaggctgac 




tgccagcagc 


agagatgaac 


attaggttcc 


30 


cctaataaca 

v*i v -»j ^ *j v 


gaatggctac 


at tggaccat 




ttactggaat 


agatacttat 


tctagatatg 


35 


caaaaactac 


catccatgga 


cttacagaat 




agcattgttt 


ttgatcagcg 


aactcacttc 




gctcatgtaa 


tttgctggtc 


ttactatgtt 


40 


aaataatqqa 


atgaactttt 


gaagacaatt 




taactggggg 


aaggttcact 


agcagagtgt 


45 


aaatcaggga 


gtgaaaatgg 


gagtgacacc 




aaaaatttgc 


ttcctgtcct 


gaaacattat 




ggaagaaatg 


cttctaccag 


gagacataat 


50 


cccaac tagc 


cattttggac 


tttttatctc 




tgctggaagg 


ggttatcgat 


actgactacc 


55 


taaggaagtg 


tatattagta 


atacaggaag 




cctgtgatta 


aggtcaatgg 


aaaactacaa 




ccagactctt 


caggagtgat 


ggtttgggtc 


60 


gaggtgcttg 


ctgaaagcaa 


aggaaatatg 



ccaggcacaa 


ccccaaagta 


gacagctgta 


1392 0 


aagaaaataa 


tgaagaaaaa 


tcttcccagt 


1398 0 


tatattttgc 


ttggaaggag 


aaatagcegg 


1404 0 


tggccaatgg 


tgtggctgga 


atgtcaggga 


14100 


acataggagt 


ctagagaaca 


ggtttgtgga 


14160 


atatttgtgt 


cccatgttaa 


tgtccaccaa 


14220 


agtagctggg 


attacaggtg 


cacaccacta 


142 8 0 


gatggggttt 


caccatgttg 


gecaggctag 


1434 0 


cgcctcggcc 


tcccaaagtg 


ctgggattac 


14400 


ggaagatctt 


aataatcaag 


tagataggat 


14460 


ttcctcagcc 


accccatcat 


tgtccaatgg 


14520 


ggcagggatg 


gaggttatgc 


atgagctcag 


14580 


ttggctacag 


tcactgatag 


gtgcccaaaa 


14640 


tgatatggta 


ctattccctg 


gggtgattag 


14700 


ttctgtcatg 


gagagggcag 


tgttttcttc 


14760 


gttttacctt 


ccctgcattc 


aatgcctctg 


14820 


gecatatcta 


ttgccatggt 


aatgccacac 


14880 


agagcaaatg 


aagtgcaaca 


ataagctcaa 


14940 


ccccaccatt 


atgaagcagc 


tggtttgaca 


15000 


acagcaccag 


ctagctggca 


atatcctgea 


15060 


ttctcccaca 


gctaggattc 


acaggtccag 


15120 


actcactatt 


acccatagtt 


acctactagc 


15180 


actgtgcttg 


tcagatgtct 


ttattccaat 


15240 


aatgatttca 


ttgaactgga 


agttaggact 


15300 


tgaatcagtt 


ggcaaagaag 


ggagttattg 


15360 


ataaggaaat 


ttggcctatt 


acttcacagt 


15420 


ccccctaggg 


tatcctcagt 


attaccatgt 


15480 


caacccaatt 


agaacaggac 


tactgatggc 


15540 


aacccaccaa 


gtcaataatc 


acagcctgct 


15600 


gaatgggtag 


tgggagaaaa 


taatcataaa 


15660 
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taccagctat 


aaccatgtgg 


ccagttacag 


aaataaggac 


tgtaattgtc 


atgagtattt 


15720 


ttccttattt 


tgttatgaac 


atgtttgtgt 


gtttgtgtgt 


attacatacc 


tttgttttct 


15780 


tccctctctt 


atccccttat 


tatgtaacat 


aagatgtgtt 


gactttatgt 


tatggctttt 


15840 


aagttgtggt 


atttcaacag 


aagagtaaac 


gtcactcaag 


aattttgcat 


cctcttctgg 


15900 


aaaagtgtta 


gtgcattttt 


tgttgtatgc 


atgacagctg 


ttaggcaaaa 


gtacatcttt 


15960 


ctttggagat 


taagtatggt 


ttcacgagat 


gtgtaagtgg 


actgctaagt 


ggactgctaa 


16020 


gtggacatga 


agttgacttg 


tgatgggtaa 


ttttatgtgt 


caacttatct 


ctgtcaaaat 


16080 


atgcctagat 


agttgttcaa 


acattattct 


ggctgttcct 


gtgagggtgt 


ttgtggatga 


16140 


cattaacatt 


taagtcgctg 


gtctttgagt 


aaagcagatt 


gcccttcata 


atgtgggtag 


16200 


attttattca 


atcagttgaa 


ggtctgagtg 


aaacaaaagg 


ctgaccttcc 


cccctccatt 


16260 


tccctcatcc 


ctcccctcat 


ggagcaagag 


gaattttcca 


gaagactgcc 


ttcaagcttc 


16320 


atctgcacca 


tcagctcttc 


cgtatctcca 


gcctgctggc 


ccacactata 


aattttggac 


16380 


ttgccagcca 


aaatttacag 


agtacatgta 


cagatacaga 


gtaaatcaaa 


acgactgagg 


16440 


caagtctaaa 


tcattttaag 


atgtttgttt 


gccaaggtta 


aggacatgcc 


tgagaaagaa 


16500 


cccagaacca 


cagggaaaat 


tgtgacccat 


actttttctg 


aagagggtct 


ggagacctca 


16560 


gtattcaaag 


gcaaaagcgc 


aggtatgagg 


aaagaggaag 


aaatttttta 


aagggtgtga 


16620 


ctagataaga 


ggcaagcagt 


tgcatcctct 


tgagtgtttc 


atcagccatt 


ccctgaatac 


16680 


acaatgtacg 


catgacaggg 


gggtacagga 


atagttactt 


atacatttgt 


ctagtccaat 


16740 


aaatctgcct 


tttcacataa 


gataaaataa 


acatagggca 


gaggaagcaa 


tcagatatgc 


16800 


atttgtctca 


ggtgagcaga 


gagatgactg 


agttctgtcc 


ttcgtcccac 


acctgtgaag 


16860 


ataaggtcct 


aagatttatt 


ttccttttac 


taacatatat 


attatacaca 


catatatgta 


16920 


tatataaaat 


atatagggtc 


tgtttctctg 


gagaactctg 


aataatatac 


ttttacataa 


16980 


tctcaatacc 


agaacagaaa 


cccatatatt 


ccctatgaac 


atgaatttta 


aggtaatctg 


17040 


attgaagtag 


ctttaattca 


cttaaaacta 


cctaaataga 


tcagaaagaa 


agaatgggca 


17100 


ccaggggata 


aagtagttgc 


agggcagatg 


atgttgggtt 


ttaatctcca 


atacggcccc 


17160 


ttcggtggtt 


gcacacatcc 


tagtcactgg 


tgtgcgcagg 


tggctctact 


ccggctggaa 


17220 


aataaagccc 


agctccaagc 


aggccccaga 


gaaccacttc 


acataagagc 


tatagaaaac 


17280 


caggagtcac 


tagagggcct 


gcttgcctgc 


ttttttgaac 


aaggtctaga 


gctgtgatcc 


17340 


agattggtta 


caaaggtcag 


cactggttgc 


aggagcaggc 


agagcagaga 


gaaactgagg 


17400 


tccaagtcac 


aaacaagaga 


ggatgaggga 


agtaaggcaa 


gatcaaaagt 


tcctatttgg 


17460 


gattcacagt 


ggagaacaag 


agctgagtct 


acaggctgga 


gctggcctca 


gagaaccaac 


17520 
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catgggacgg agacaaatat caaggagaac 
ttttgttgtg tgtcctgggc caaacaaata 

5 

ctagaaaaat aaatattagt tgtctctaaa 
tgtttaaata catgaggaga ccgcaccacc 
10 tgctttctct ctgacacaca attctttcac 

acattctcct aacctctcag tcacccagac 
taataatgtt ggctctctct tctcacaggc 

15 

tgcactggga actctctgaa ttcattggat 
tcctatccct ataattgctg tagtaattta 
20 accatggtat tattgctttc aattcattta 
atgggctaaa gacccaatct ctaacttaaa 
gaaatgcctt atg'tttgttg tatattttaa 

25 

tggaaaatat attaaaggag gatgcttgaa 
ctacaaggct tatcttttgc agaatgaaaa 
30 tacagaggca atttttagag agcatgtata 
cacaggagga atttggggaa attaaacaga 
tattataata attccatggt agcagttgcc 

35 

agaagcagaa agtttaccca aggttaaatt 
tacccacaac tttgtacttt tcaaccaaaa 
40 taacacattc ccttttgctt tgtgaaaagg 

cccagagaag ggcctggtga tctctgctct 
tcttatagtg ggagtcctca agtggaagac 

45 

agtgtatgta ataaagacaa atcatttaaa 
aatcatctag aaagaatcct cagcccaaga 
50 gccatagata ataattactc aattttatac 
ctccaagttc attttacctt aattgccacg 
ccatttgaca aacagggaaa ctgaggcttg 

55 

cagcaatgaa gtgtcagagg taaaaacaaa 
tgacaggatt tcattttcct gggctgaagg 
60 tcatcttttg ttattgccgc aggcagattg 



cttaaggatc 


ctcagagcca 


acagctctcc 


17580 


ggcctctgct 


ttttacttaa 


ctagaaccaa 


17640 


ggttgctatg 


aaacaaaagc 


aaatactccc 


17700 


taactgtggc 


tgccatttct 


ctattcagcc 


17760 


tactgcttca 


gaagtaccag 


ggtaaagcaa 


17820 


tgtaatcatc 


tgtggtagat 


ttgacttgag 


17880 


tgtgcaaata 


atgattggcc 


tgattcacag 


17940 


tttgctaaga 


gaggattcaa 


gtggttctgg 


18000 


ctcatttttg 


tcattatttt 


ttgtgagtaa 


18060 


tttttttcta 


ttgaattcag 


gtgtctagaa 


18 12 0 


ggagcccatt 


tttgtgtgaa 


tacagactta 


18180 


taattttctt 


gagaggttga 


ggtgcctata 


18240 


gaataataaa 


gagagtggct 


atttgggtaa 


18300 


atccaaaggc 


cttagcttaa 


tgtctggggc 


18360 


cagaaaaaaa 


aaaaaaagga 


gacctagagg 


18420 


gatttctaaa 


gtagatgttc 


tgaataaggg 


18480 


aaaactccca 


ccaagggcat 


gcccaattac 


18540 


tctccataga 


agagagtttt 


attacctcct 


18600 


acagggcttc 


aagggagaga 


agatttagta 


18660 


cctctgagac 


atccctgtac 


agtcatttca 


18720 


ggataaaaga 


cattattttc 


tctaaggctt 


18780 


atttacaaac 


accgagaact 


ttctggaaag 


18 84 0 


ctgaaataat 


acaattaatc 


acttattgca 


18900 


tatgaaagag 


gagcatatga 


cagatattta 


18960 


ctattatgta 


tcaggcatct 


ccaaaaggca 


19020 


acaaccccat 


gagatagcta 


tcattcatct 


19080 


gagaacttaa 


gtgactctca 


cagagtgaca 


19140 


caaacacaag 


ctaagatgat 


tctgccttca 


19200 


atccccagtc 


cccttcttag 


tatgtttgaa 


19260 


aagacaggta 


tatgatttac 


tggaaaaaaa 


19320 
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ttaattcttg 
cacttagggt 
5 aaactattcc 
agtgagagcc 
aataaatgac 

10 

gctccctaag 
ttttcatccc 
15 ctacagactt 
gccactttgg 
cataaggcat 

20 

cagtagtgta 
ctctaaagat 
25 aacatttttg 
ttgtaaagat 
agcaataata 

30 

ctatctcaac 
aggctggtgt 
35 atcctccagc 
tttttcattt 
tggcctcaag 

40 

actaaaccta 
aaatgtaatt 
45 aataatgtgt 
cacaaaccag 
tccctcctgt 

50 

ctgaattttc 
actctgtgcc 
55 caaagggttg 
cacagggtgg 
ctgaagtcgc 

60 

atacaaaaat 



gtattccaga aacaacaatc 
atgatatttg tggaaggcaa 
cactgatgaa atttttttta 
accacactag ccactaggaa 
aattaaaaca agtgaagtgc 
tcccatcacc tatgcaaagg 
tataacagag aaatagtgaa 
cgaatggggt ttttcatgtt 
atattaggaa tgccactggc 
tgtgtgttta taaatgctta 
atacagaagc ctgttacgag 
taaaatgtta cttggtgcaa 
gggaaaaaaa acttttccaa 
atcctccagt gcatcctctg 
ttgtatacag agacagacgt 
gagcatatat aaccaaataa 
gcagtgggtg atcacagctc 
ctctgcctcc caaatagctg 
tttgcagaga taagggtctt 
gcatactcct gccttggtct 
gccaccaaac aaaacttcga 
aggataagat tagaaaatta 
gaaaagagaa ttgccttcag 
atatcatgac aacacatcta 
ctctcacagc cccccttgtg 
caattctgac tgccttacta 
tcactctcct catttaaaaa 
ctgtgaggat taaattagta 
ctcatgccta taatctcagc 
gtgttcgaga ccagcctggt 
tagctgggtg tgatggcgca 



ctttcaatta gtgccagaag 
cctataaatc ttcagatgta 
taacattatg ggagaaacgt 
atattttcta ataattcatt 
ctacaacttg tcctcataaa 
tctcattaat gaccggagac 
tccatgcttt agaaattgta 
aaaatcactt tcttcagaaa 
tctagggaaa gaagctattg 
tttctatccc cagttcatca 
atataaggta tgtcagaata 
acaaacaaaa aaagaaatta 
atgtttaaat gaaaaaaaat 
tgcatgaatg ttggaggagg 
gagagtttcc atgaaataaa 
aacttaagac aaggccttac 
aatgcaacct ggagctcctg 
ggactgcagg tgtgtgccac 
gctatgtttc ccaggctggt 
ccgaaagtgc agggattata 
ccctgagaat caaagcattg 
gtgtctagtt aaggagaatg 
caaaaccagg tcaggaaata 
acgaaggatg gtcctttttc 
attaggatca cagctctgga 
gtgcctgtgt aaccctgagc 
ataaaatgtg gttaataata 
tgtaagcata aagaacagta 
cctttgggaa gccaaaggca 
caacatggca aagccccgtc 
cgtctgtaat cccagctact 
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gtgatgatag 193 8 0 

atgttttctg 19440 

agttcttgca 19500 

tcaagggatc 19560 

atcattgtga 1962 0 

tttaaaacat 19680 

ctagcggtca 19740 

tgcattcagt 198 00 

aggggat t at 198 60 

actcaggatc 19920 

ttcggagcaa 19980 

ttgctgggcc 2 0 040 

tatttaattt 20100 

atacagtgtg 20160 

tcatttcaag 20220 

tctgtcaccc 20280 

ggctcaagtg 20340 

cacacctggt 20400 

tttggactcc 20460 

gacatgagcc 2 0 520 

ggaagcgaag 2 058 0 

gataagacaa 20640 

tcaattagaa 2070 0 

ttttagaact 20760 

ggcagatagc 2082 0 

aaattacttt 20880 

gtaactgcct 20940 

cctggggcca 2100 0 

ggtggatcag 2106 0 

tctactaaaa 2112 0 

caggaggc tg 2118 0 
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aggcacgaga atcgcttgaa cctgggaggc 
tgctctccag cctgagtgac agagcaagtc 

5 

cacaaagata ccactctgct tatcaaaacc 
ttccccaggg gaaaatctgc tttcagggac 
10 atatttcaga t-aaaggggtc ctcctatttt 

tgagaggcca tttttctcag ccccagattg 
cacttttttt aacttctatt cccaaggtag 

15 

ctttttctaa gctttttttt tttttttttt 
tgtgacttag aagtcacctg gtgttgcttt 
20 taagtacaga tgacatgaaa gcccataaag 

tagcattctc atagtttagt ttctgtattg 
acccaaagaa acagttgtga ataaactcag 

25 

aaaagttttg ggttttttta ctcgaggatt 
agttccagtc acaacctaga tccctttgta 
30 tctgcccttc tttttccatc tccccactcc 
atatgtgtgt gtgtgtgtgt gtgtgtgtgt 
gtcaggcatg ttctatgtat gtaggatgca 

35 

ccttagggag cttacattct aatagcagga 
gggaaataat ataatatgtt agaaggtgat 
40 gggctgccat agtttcgtct tgaacatttg 

agatatttaa agccaggaga cttgttgaga 
aaagagaaac aagaactgag ccctagggca 

45 

aaaataataa tttttttact tgaaaagttt 
tattatatct tattcgtgtg catgttttca 
50 tgtttttgtt tgaatttttt tttttgtttt 

tcttgctctg ttacccaggc tggagtgcag 
cctcccaggt acaagcaatc ctcccaactc 

55 

ccaccaccac gccctgctaa tttttgtatt 
ccaggctggt ctcaaactcc tgacctcaag 
60 tggaattaca gtcattagcc accgtgcccc 
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aaaggttgcc 


gtgagctgag 


atcctgccac 


2 124 0 


tccatctcaa 


gaaaaaagaa 


cagtaactgg 


o i a a 


.cgctgagaaa 


aaacaggtgg 


attctcctct 


213 60 


tggtaagaga 


agaaattgaa 


attgtatttc 


2142 0 


ttctttcagt 


agaatagggt 


cagagtggta 


21480 


gaagctataa 


ttatgggcat 


taagctatta 


2154 0 


ccactgttaa 


tatcattttc 


cat tctaggc 


2160 0 


tttgtaattt 


gtttgcaagc 


agagtgaaca 


21660 


tccacggaaa 


accttacatc 


cacagagata 


2172 0 


cccccatgtg 


atttggtaaa 


ctcccagaac 


2178 0 


gtgcatactg 


tcccaaattt 


tctccacaac 


21840 


ccccttgact 


gattttcaca 


aaaactcaca 


21900 


ggtgggggaa 


gattatgatt 


gtaatttcca 


21960 


actctctccc 


taatccccat 


cactccgtct 


22 02 0 


acctcaatca 


atcaatctat 


acatatacat 


2208 0 


gtagatatat 


atatatataa 


tttgctgcat 


2214 0 


tcagtgaaca 


aacagatgaa 


aatttattac 


T O O A A 


taaaagcaat 


aacaataaac 


aggacaaaca 




aggtgtgatg 


aaaaaaagaa 


aacgtaataa 


22320 


gacggattca 


gttgagaatc 


ttagacatga 


22380 


tctccagtgg 


aaagaacaca 


acaaagacga 


2244 0 


ctccaacatg 


aaaaatagaa 


aaaaggggag 


22500 


cttggtttaa 


tcatgttcta 


aagagtaaaa 


2256 0 


agttaaagca 


gaaaataaat 


gtagaatata 


22 62 0 


ttgcttgttt 


gtt tgttttt 


tgagacagaa 


22 68 0 


tggtgcagtc 


tcagctcact 


gcaacctctg 


22 74 0 


agcctcccaa 


gtagctggga 


ctacaggctc 


2280 0 


tttagtggag 


acggtgtttc 


atcatattgg 


22860 


tgatctaccc 


acttcagcct 


cccaaaatgc 


22920 


accagaatat 


atgtcttttg 


taccataatt 


22980 
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tttaaattgt 


tttttcaaac 


aatatacatt 


aagtgacttc 


cttttggggt 


gcagatattt 


23040 




gagttcagga 


acttttttct 


gaaaacatat 


atatttattt 


tcatgtaaac 


ctgggaaaat 


23100 


5 


ctttagctgt 


ctatgtaagt 


attatcttaa 


ttgattcttt 


tcttaagcat 


acaactcaat 


23160 




agtttttagt 


atattcactg 


atatgtgcaa 


ttattaacac 


agtccattca 


aaaacatttt 


23220 


10 


catcacctca 


aaaagcaact 


ctgtacccct 


tggctatcac 


tcccatcccc 


aacttcccat 


23280 


tctccccaac 


cttaggcagc 


cactaatcta 


catagggtct 


ccatatattt 


tcctactcca 


23340 




gacatttcat 


taaaaatgaa 


tcacatcata 


tgtaatttct 


tgttaactgg 


cttatttcac 


23400 


15 
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ttotaacata 


tatcaatact 


ttattctttt 


23460 




tatgactaaa 


catttctata 


caccataaca 


ttcaagctga 


gagtcaaatc 


aagaatgcaa 


23520 




ttgcatttac 


aatagccaca 


aaagtaataa 


aatacctaga 


aatacatcta 


accaaggagg 


23580 


20 


















tgaaagatct 


ctacaaggaa 


aactagttaa 


tacaaaatat 


tgcttaaaga 


aatcatatat 


23640 




gacacaaaca 


aatggaaaaa 


cattttatac 


tcatggatta 


gaagaatcaa 


aatcattaaa 


23700 


25 


atggccatac 


tgcgcaaagc 


aagctacaga 


ttcaatgcta 


tttctatcaa 


accaccaatt 


23760 




atcattttta 


gcacaattag 


aaaaagtatt 


tctaaaattc 


atatggaatc 


aaaaaagaac 


23820 


30 


ccaaatagca 


aaagcaatct 


taatcaaaag 


gaataaagcc 


agaagcatcc 


acattacctg 


23880 


acttcaaagt 


atactacaag 


ggtatagtaa 


ctaaccagta 


ccatgctggt 


actggtgcaa 


23940 




aaacaggcac 


atagatcagt 


ggaaccaaat 


agagaatttg 


gaaatacatc 


tgcacaccca 


24000 


35 


aaaccaactt 


atattcaaca 


aagtcaacaa 


aaataagcaa 


ttgggtaagg 


attccccatt 


24060 




cagtaatggt 


gctgggataa 


gtggccatcc 


atatacagaa 


taataaaacc 


gaaccctcaa 


24120 


40 


aagcaattgc 


aacaaaaaca 


aaaattgaca 


aatgggacct 


aattaaacta 


aagactttct 


24180 


gcacagcaaa 


aaaaagaaac 


tattaaacta 


aacagacaac 


ctgcagaatg 


ggagaaaata 


24240 




tttgcaaact 


atacgtccta 


caaaggacaa 


atactcagaa 


tctataagta 


acttaaacaa 


24300 


45 


ttcaacaagc 


aaaaaacaaa 


taaccaaatt 


aaaaagtggg 


caaatgacat 


gaacagtcac 


24360 




ttcttaaaaa 


gaagacatac 


aagcagccaa 


caaacatgaa 


aaatgcttaa 


tatcactaat 


24420 


50 


tgtcaggaaa 


atgtaaacca 


aaaccaaatg 


agataacatc 


tcataccagt 


cagaatggct 


24480 


attactaaaa 


agtcaaaaaa 


tagatattgg 


tgagaatgcg 


gacaaaagga 


acacttatac 


24540 




actgctggtg 


gaaaagtaaa 


ttagctcagt 


ccctatggaa 


agcatctgga 


aatttctaaa 


24600 


55 


gaactaaaaa 


taaaactacc 


acttgaccca 


gcaatcccag 


tgcgcgtgtg 


cgccccnggg 


24660 
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2472 0 


60 


caaagacacg 


cgctcactcc 


cggtgcccct 


ctctggcgat 


atgggtatag 


aaaaagagat 


24780 


atatctctat 


atatacatat 


atatatatat 


atatatatat 


atatatatat 


atatatatat 


24840 
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atatgcatac catggaatac tacacagcca taaaaaagag tgaaatcatg tcctttgcag 24900 

caatatggat gcagctggag gccattgtcc taggtgaatt aatgcagaaa cagaaaacca 24960 

5 

aatgccagtg gggttataat aagtgggagc aaaacgttgg gtacatatgc acataaagat 2502 0 

aagaaaatag acattgatga atgctactgg gggaaggcag gagggaagaa agttttaaaa 25080 

10 actacctctt gggtactaac tgggtgatgg gatcaatcat accccaaacc tcagcattat 25140 

aaaatacacc caaataacaa aactgtacat atgccccctg aatctaaaat gaaagttgaa 25200 

ttttttttaa agaacgtgaa tagatattgt tttatttcta tagtttattc agaaataaat 25260 

15 

ctgcacaccc acaaccaact ggtattcaac aaagtcaaca aaaataagca atggggtaag 2532 0 

gagtccctat tcaataatgg tgctgggatc attggccatc catacacaga ataataaaac 25380 

20 tggaccctct gaatgccttt tgggttttct ttactgccga actgccctgg ttagaatttc 25440 

taatacaagg ttgaatagat ataatgagag tatatctcac cgtcttattc ctcatgaaag 25500 

cattgagtct ttcaccaata ggtgtgttag gtgtggattt ttattagatg ctccttatca 25560 

25 

ggttgagaaa gatcccatct attttgagta ttttgagtgt ttttatcata aaacagtgaa 2562 0 

gaattttcac caaatgcttt gtctacatca attgagatga tcatatgtgt gtttttctat 25680 

30 tttactgata tggcatatga cattaattgt ataagataaa accagccttg catttgtgga 25740 

ataaatcaca cttgcttaca gtgtacaagc atacctcgtt ttattgtgct ttgctatatt 25800 

gcacctcaca gataatgagg ttttttacaa agtgaaggtt tataataaca ctgcatcaag 25860 

35 

caaagctgtt tcttcaacat cacgtgctca cttcatgtct ccgtgtcacc ttttagtaat 2592 0 

tctcacaata tttcaaactt ttttatgatt attatatttg tcatagtgat ctgtgatcag 25980 

40 tgacctttat gattattata tttgtcatag tgatctgtga ttggtgacct tcgatattaa 26040 

cattgttatt gttttggcat accataaact gtgtctatat aagacaacaa atttaattga 26100 

taaatatgtg cgttctgact gctctaccaa ccacacttac ccttgtctct ctccatctcc 26160 

45 

ttgggcttct ctattttcta atatacaaca acattgaaat taggccaatt aattatattt 26220 

gtcatagtga tctgtgatca gtgaccttta tgattattat atttgtcata gtgatctgtg 26280 

50 atcggtgacc tttgatatta acattgttat tgttttggca taccataaac tgtgtctata 26340 

taagataaca aatttaattg ataaatatgt gtgttctgac tgctctacca accatcctta 26400 

cccctgtctc tctccatctc cttgggcttc tctattttct gagatacaac aacattgaaa 2 6460 

55 

ttaggccaat taataaccat acaatggcct ctaaatttta agtgaaaaga agagccacac 2652 0 

atctctctct taagtcaaaa gttataaatg attaagctta tgaggaaggc atatagaaag 26580 

60 gtgagatagg ccaaagttaa tcctcttgca cccaaccatt aacataggtg tgaatgtaaa 26640 
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ggaaaaattc ttgaaggaaa ttaaaagtgc 
acaaaacaag cttgttgctg atgtggagaa 
5 ctacagcatt tccttaagcc aaagcctaat 

attctgtgaa gaccgagaga gatgaagaag 
ttgattcatg aggtttaagg aaagaagttg 

10 

agcaagtgat aatgtaggag ctgcagcaag 
tgaaggtggc tgtattaaac aacagatttt 
15 gaagacgtct tataggactt tcccagctag 
acaggctgac tctcttgtta gaggctaatg 
tcatttacta tactgaaaat tctagggccc 

20 

gtaaatggca taagaaagca tggataacag 
attttaaatt caatattaag aactactact 
25 ctgctcattg acagtgcacc tggtcacatg 
tttgcagtga tgaatcaagg agtaatttgt 
ttttgtaaga ctacagctgt catagctagt 

30 

ttgaagagct tctggaagat tcaccattct 
ggaggaggtc aagatagcaa cattagcagg 
35 gaataacttt gaggggttca agacctcggc 
agcaagaagg gaagcctgaa gatgtgactg 
tggataagga gttgcttctt atggatgagt 

40 

ctccagtgaa gatgctatga acattgctga 
aagcttagtt gataaagcac tgtcaaggtt 
45 ttgtatggtg gataaaatgc tatcaaaaaa 

aaagaagagt tgatgggtgc agcaaacttc 
ggaccccaac cttcagtaaa caccacccta 

50 

gaccctccac cagcaaaaac attacagctc 
tagcaataaa atattcttaa attaagtaca 
55 acacttaata gacgacaata aactgtaaac 
agtctctctc tgtcccccag gctggagtgc 
cgcctcctgg gttcatgcca ttctcctgcc 

60 

acctgccacc acgcccggct aatttttttg 



PCT/US01/48437 



tgcttgagtt 


aacacacaag 


tgataagaaa 


26700 


agttttagtt 


gtctggacag 


atcaaaccaa 


26760 


ccctagcaag 


gccctaattg 


gctcttttca 


26820 


cttcagaaga 


agagtttgaa 


gctagcaagg 


26880 


tctccataac 


ataaaaatac 


aaggtgaaac 


26940 


ttatccagaa 


gatctagcta 


aaataattga 


27000 


caatgtagat 


aaaacagcct 


tatattggaa 


27060 


agacaatgct 


tgggctcaga 


gcttcaaagg 


27120 


cagctgatga 


ctgtaagttg 


aagctaatgc 


27180 


ttaagaatta 


atctactctg 


cttatggtct 


27240 


cacatctgtt 


tacagcatga 


tttacagaat 


27300 


caggaaaaaa 


agcttccttt 


taaaatatta 


27360 


tgttttcatg 


tctgttaaca 


taacatccaa 


27420 


actttcaagt 


cttgttattt 


aagaaataca 


27480 


gattcctctg 


atagatctcg 


gcaaagtaaa 


27540 


agatgacatt 


aagagcatta 


atgacataca 


27600 


actttggaaa 


aagttgatac 


tacctctcat 


27660 


ggaggaaatc 


actgcagatg 


ggatggaaat 


27720 


aattgctgta 


atctcctgat 


aaaactttaa 


27780 


aaagaaagtg 


ctttcttcag 


gtggattcta 


27840 


aatgacaatg 


aagaattcag 


aatattacat 


27900 


ggagaggatt 


gactccaatt 


ttggaagaag 


27960 


cattgcatac 


aacagaggaa 


acttttggga 


28020 


atgttatcta 


atcttaagaa 


attgccacag 


28080 


atcagtcagc 


agccaacaac 


atcaaggcaa 


28140 


actgaggctt 


aggtgatcat 


tagctttttt 


28200 


tttttagaca 


taaagctatt 


gcacacttgc 


28260 


agtagcataa 


cttttttttt 


tttaaggcgg 


28320 


agtggcacaa 


tctcggctca 


ctgcaagctc 


28380 


tcagcctccc 


aagtagctgg 


gattacaggc 


28440 


tatttttagt 


agagatgggg 


tttcgccatg 


28500 
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ttagccagga tggtcttgat ctcctgacct 

gctgggatta caggtgtgag ccactgcgcc 

5 ^ 

aactaaaaaa ttcatctgtt acgttattaa 

tgaacctgca atatctccaa agtatgcctg 

10 gtttgctaac attttgttga agatttttgt 

gttttctt'tt gttgtgatgc attagtctga 

gaacgagtta ggaagtgttc cctcctcttc 

15 

ttctttaaat atttggtaga agtcaccact 

agaagatctg attataaatt taaatacctt 

20 ttcttcctta gttttagtag tttgcatgtt 

ctaatttgtt ggcatataat tgttcatagt 

ggtttgtagt aatgtttcca cattcatttc 

25 

ttctttagtc tcattaaagg cttttccatt 

gttcattgat tttatctact gtttttctat 

30 ttactacttt atttccttct gctttcttca 

atcttaagat ggaaagttag gttattgatt 

tttacagcta taaatttcta tctaagcact 

35 

gctgtatctt cattttcatt catcgcaaag 

taatcccttg gttatttagg agtatgttgt 

40 ttgtgttatt aattttaatt ttgttccatt 

gccttttaaa tttattaaaa tgtgttatgc 

ccatgtgcag ttcagaaaac cgtgtattct 

45 

tctgttaggt ttagttggtt tataatgttg 

tgccttgttg ctctactcat cattaaaagt 

50 aaattgtctg tttctttctt tgtttctgtc 

ttcttagaaa atatatgttt ataaccatta 

tttttttttt ttgagacgga gtctcactct 

55 

ctctgctcac tgccagttcc gcctcctggg 

agtagctggg accacaggcg cccaccacca 

60 agatggggtt tcaccatgtt agccaggatg 
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tgtgatccac ctgcctcggc ctcccaaagt 28560 

cagccagcat aacttttata tgcattggga 28620 

agtattcatt ttattgcagt ggcctgaatc 2 8 680 

catatttctt tttctgtgtt gctgtattga 28740 

gtccatgttc ataagagatc atggttttta 28800 

ttttgatctc agggtaatag tggcctcaca 28860 

tattcttaga agagtttgtg aagaaggtaa 28920 

gaagtaatct gggtctgcac ttttcttcat 28980 

aattctttat agtctattca gattttctat 29040 

tctaggaact tgtccatttc agcaggttat 29100 

attctttcat aatcacattt atttctataa 29160 

tgattgtact aatttgagac ttctctattt 29220 

ttgttcatct ttcaaagaat caacttttgt 29280 

tttgcttcta aataattttc actttaatct 29340 

ggtttgatat gggttttttt tattttcatt 29400 

tggggtcttt cttctttgtt agtataggca 29460 

aatttagctg catcctataa gcattgatat 29520 

tattttctaa ttttcttttt atttcttctt 29580 

taatttccac atatttgggg tttcccaaat 2 964 0 

gtacttagag aaaatacttt actatttcta 29700 

cctaaactct ggtctattcc aaagaatgtc 29760 

tctgttgttg ggtgaaatgt tctatagatg 29820 

ttcaaatctt ctatttcctt gtttatcttc 29880 

gggtcattga agtccctaac tattattgtt 29940 

tggttctgct ttgcgtgttt caagcctctg 30000 

taaaatgttc ctctctatct ctagtagcat 30060 

gttgcccagg ttggagtgca gtggccccat 30120 

ttcacaccat tctcctgcca cagcctcccg 30180 

cgcccagcta attttttgta tttttagtag 3 024 0 

gtctccatct cctgacctcg tgatctgccc 3 0300 
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gcctcggcct cccaaagtgc tgggattaca 
ttaaaatata ttttgtctgg tattagtata 
5 ggattacaaa tatttttcta tttttttgtt 

tgtgtctccc atagacagca catagttgga 
tgcctttaat tgaattgttt tattaattca 

10 

tgcatttgac attttacttt ttctccatgt 
gactgcttta ttttttggct aagtggatat 
15 ttttttactg tatatttcag ttatttcatt 

acttatcaaa atctgcttca cat t tat act 
atccctgtat agctctattt caatacctct 

20 

tgaatgttac aaacataaga taacattatg 
aaagaagcaa agagaagaaa aggaacaagt 
25 tatctattta ccattacaat ttattattta 

ctcctcattt gttcctgtgg gttcaaattt 
ttccttagct taaaatacat ttgctcctct 

30 

attgcatttc cctatgttat agacccaaca 
tgctttttaa ataagttagt agaagaaaga 
35 atttgtataa ttttcttttg tcagtgatct 
tgtagtcact tgcttttggt ctgaagaatt 
caacaaattc tttcagattt tgttcatctg 

40 

aagacagctt tgctagaagt aggattcttg 
ttgaatttgt tatcccacta ccttctgggt 
45 aatcttattg gagttccctt gtatgtctca 

ttctctttgt tggctttcag cgtttttact 
agtttcttcc ttgggatttg ttgagattcc 

50 

atatggaaaa ttttcaggca ttatttcttt 
cccctttctg gtactttcat tatgtatatg 
55 taagtctcta ttcatatttc ttcattcact 

ctatcaatct atcttgaaat tcactaattc 
gccactctaa cgaatttttt attttagctg 

60 

ggtgttttaa aaatgattta tatatcttta 
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ggcatgagcc 


accatgcccg 


gccaacattt 


30360 


gccattctag 


ctttcttgtg 


tttgttcttt 


30420 


ttcattcttt 


ttgtgtcttt 


gaatctaaag 


30480 


tctggttttt 


gtatccaatc 


tgacaatctc 


30540 


catttaatct 


tatattgaaa 


tagttaggct 


30600 


tttatgtttt 


ttgtttctct 


attctgcctt 


30660 


tttctaatgt 


agtattttaa 


tttttaataa 


30720 


aattgtcgct 


ctaaggctca 


tcatatttta 


30780 


aatttaattc 


cagtgaaata 


tagaaacatt 


30840 


ttatgtgata 


ttattgatgt 


attacatcta 


30900 


attattactt 


tatataattc 


tatctctttt 


30960 


atacatttac 


agtttttgtt 


atttaccatt 


31020 


ccagaaattt 


tttatttata 


atttcttgtt 


31080 


aacatctaga 


gttaaactcc 


tagatgttga 


31140 


ccccccactt 


ttgctgttat 


tggcaaatat 


31200 


atacgttaca 


tacatattgt 


ttcatatgat 


31260 


ggataaattt 


gcatttacac 


tttcttttat 


31320 


ttatattttc 


atgagattca 


aattactgtc 


31380 


ttctttagta 


ttttttgtaa 


ggcatgctag 


31440 


caaatgtctt 


tattttgttt 


tcatttttga 


31500 


gttgataggt 


cttttttttc 


tttgaaccct 


31560 


ttcaatgttt 


ctgattagaa 


gtcagctgtt 


31620 


agtcattttt 


ctcttgctgc 


tttcgagatt 


31680 


gtgatatgtc 


tacttgtgga 


tttctttgtg 


31740 


taggcggtat 


agattaatgg 


ctttgaataa 


31800 


gaatatattt 


tctgcttctt 


tctctttctc 


31860 


ttgtttcctt 


aatggtattt 


cacatttctc 


31920 


tttctctctg 


tctttcagat 


tgcataatct 


31980 


tttcttctgc 


cacttcaaat 


ctactgttgt 


32040 


ttataatttt 


caactccaga 


atttccattt 


32100 


ctgatattct 


ctatttgatg 


caatattgtt 


32160 



-293- 



WO 02/062946 

atcacacctt cctttacttc cttaatcgtg 
tgactatttt gaaacatttg tttattaaat 

5 

cctacagttt atatcatcta tgggtcaaac 
tgcctattat ataattgttt tgctggaaac 
10 tctgggtact catcccccac accaggactt 

tctaggttgt ttcagtgaag tctatttctc 
cttatagggt acaggcttgt gatgcccaca 

15 

ggctctttgt ctctttccct aaccataccc 
tgttctgagg caaaaactgc ttcacagact 
20 gatagctccc aaaatcagtg tttgagattt 
tccccttttc ccatttctct ctgggaaact 
atgtatctgt gaacacctac caagtgcctt 

25 

cttgtaagcc taaacttctc cacatgctac 
ggagatttct attccatagc ctatttctct 
30 aggggtgagg aagtgggtag ggacaacgga 
gaatggggga ggggacagaa gccccagttc 
cttaccccaa agccagaaaa aggtcaatta 

35 

gagtaaaagc tgggtagaag agatcaccca 
ctcagtgtta gtatttgggc aggaccagaa 
40 aagatagtat agcactctga ctgggatctt 

ttccaatcta gagtggtctt gctgatctga 
cagataccat accagaaact cttacagcat 

45 

cctttatttg ctatattcca ttaaaaaggt 
gatttttgaa ataattttta caaatttggc 
50 atgctggaag tgggactctc caaaaataat 

tattagaatt tttaggaaca tagatacctt 
atcctcatta atggacatag gataagtgtg 

55 

ctttgtacta aggatgaatc tttttagttg 
acaaagctaa aattgaaaat cctaatcaca 
60 catgtattaa atgtattatt tatacatgac 
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gcttacttaa 


gtgctcaaac 


atctttataa 


32220 


attatatctg 


gtctctcaca 


gtttttgttg 


32280 


ttgcctgttt 


cattgtctgt 


tttatattgt 


32340 


tgggcatttt 


cgataatata 


ttatagcaac 


32400 


attgttattt 


acttgattag 


aaaaaaactg 


32460 


cgtcctcact 


gttaagcttt 


gatgttgccc 


32520 


gtcaccctgg 


gatgacattt 


atcttggaaa 


32580 


agctgttaag 


ctctgctagt 


gcttgctggt 


32640 


aacccaaaca 


aatttgagct 


cctttgaggg 


32700 


gttctgactc 


caaaagggct 


cttcctagct 


32760 


cgtgggccta 


cagtttagcc 


tgtatctaaa 


32820 


ttaccacaac 


ttccactgtt 


tttgagagta 


32880 


aaatgaagtc 


acctcctttg 


ggaacagagt 


32940 


cccaacatct 


ctgagctaag 


gatctggtgc 


33000 


gtctttctct 


ttaagagttg 


agggtttggt 


33060 


ctctcaactt 


gcctctcatg 


atgtgaaacc 


33120 


gagcccagta 


ttatagagtc 


ttcatcccat 


33180 


tctcttggct 


gccttcactt 


ggtacttacc 


33240 


aggctaatta 


gatggcctac 


ccttcctgag 


33300 


tgagaaaagg 


gagccctgtg 


tttttggcta 


33360 


gaggagggag 


ggagggtgca 


gatcttggtt 


33420 


ttactgagta 


gattttcttg 


aataaatgtt 


33480 


ttacagagta 


cagaggtttt 


aaaggattaa 


33540 


tggggagctg 


gttgagttcc 


tcccattctt 


33600 


aatgctgtac 


acttaattct 


ataaataatt 


33660 


cttaagtgat 


cctggttcat 


aagcctctct 


33720 


cagaggcatt 


ctaagcatac 


cacaatctga 


33780 


acaactccag 


cagcatagtt 


ctacatattc 


33840 


atatactaga 


ttattcctcc 


tttagtatgc 


33900 


taatacatat 


atgtattaat 


acatgacatg 


33960 
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tattaaatgt attaaatgta ttaaatttta ttaaatgaaa ttgtgcttgt aattatatct 3402 0 

cccccattag attaaaagct ccttgagtag aaacctgagg tttagtacat tgtaggcatt 34080 

gtataattaa tgaataaata aatattccct ctgaaaaaaa aattatccta ctagcttcta 34140 

gacaaatttg aggtggactc ctgatatcaa cctctgtgac aatcaaagaa agtcagattt 34200 

ttaaaacttt aacttgaatg ttctagccag taggacagac tgattgcttc agcaacatct 34260 

gtttatgctg tccataacca atttttcttt gtatgtctat tacaccttac tttagcaatt 34320 

gtaatgaaca gcataactat ttgtgtgact acacttggta taattctatt aagcatggag 34380 

15 attgcaactt ttgcattaaa tgcaaccaca tacaattggt caaacgtaag taccaatttt 3444 0 

aataataatt agatcaaaat tttaatataa atgaaaaatt atggtgaatg atgtatgttg 34500 

aagtgagtaa ggcatagaaa aggccaccaa atgtgtcaat tatctttggc atagaaatgt 34560 

20 

aaagataaac tgtattttct ctatattctt gctataagca cattgaaaaa taagtttttc 3462 0 

tcaaaaatcg tgatgcacta taaacctcaa agagagatta aaattatcgt taagataaaa 34680 

25 tgatagaata cttagaaacc ccaaagacct acctgaaaaa caactgcaga agtcaaacag 3474 0 

tttagagagg tggctggaca aaagaaaaac ctgcaaaaat taatagattg tctttacact 34800 

agaaataacc acttaaatat tgaaattttt aaaaaaatca aaataacagc aaaactataa 34860 

30 

atgtttaata agaagcattt taaaaggact gaatactttg aaaaattatg ttcttgaatg 34920 

cgaatttaca aaatcataaa attgcaagtt tctgcaaatt aacacataat tgtgatgcaa 34980 

35 ttccatttag aatctcaaca gcatttcgga agtcatgcaa taaaattact ctaaaatgca 3504 0 

tatagataga tgaataattg cataaaaatg ccaagaaagt ctagaagtta catagtgatt 35100 

gaggaggatc agataatttg ctgcactaga tattaaaact tgttattagg ctattataat 35160 

40 

aaaaatagta cagtatttac acactaaaag gaaaatttaa ctaagcacag aatagagagt 3522 0 

tgaaaaataa atctaagtat aagtgaacac atattatttt gtaggtgata acatgaccag 35280 

45 tcaatagaga aaagatgaaa ttggaaaaaa acagtattaa atcaataagc tattgatttc 35340 

caagaaaata ataatggaca cctactttgt accacaaata aaaataaatc acagatgaag 35400 

tatgtgtaca taaaaaccat taaaattagc ataaaatata gaactaatat attatttaca 35460 

50 

taagcaacag agaaaaccaa gaaggaggaa agatagaaag acattttatg tcacaaatat 3552 0 

ttgcactttt tatacaatac aggctgagac aacaatagcc taaagagtta aaatactgcc 3558 0 

55 ccagtgggct gagggagtgg ggtccccacc ccagggtgac aaagaatagg gctgtcccct 3564 0 

agctgactca gaagacagaa ccacagagga ttatcctcag gcctcaaaaa tctaacaaaa 35700 

ttttctctat tgggttttgg acttgcttag gtctgctgcc cacttctttc tttcttattt 35760 

tctctattgg aaatcgaaat gactatcctc tgccagtccc accattgtat tttagaagta 35820 



60 
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gattactagt 


tttctggctt 


cacaagtcca 


5 


aatcatgcta 


taagtgatga 


tttctagcat 


tgagatttga 


ggatttttga 


ttttataata 




ctagagtggg 


ttacaacttt 


cggagatgtt 


10 


aaggacatga 


attoaoQQQa 


ac tagaggca 




acatatatgt 


tgcagtccta 


acccatggca 


15 


attattccag 


gtataataag 


ctaaattaag 




aatccagtat 


gattggtatt 


cttataagag 




tggcactgtg^ 


tattagttca 


tttttacact 


20 


t t at aaagga 


aagaggtt ta 


attgactcac 




acttatagtc 


atgacagaag 


gcaaaggaga 


25 


gagagagcaa 


aaggggaaca 


gctacagact 




tcactaccat 


gaaacagcat 


gggggagcca 




ccctccctca 


acacgtgggg 


attacaattc 


30 


aaaccatat t 


attccatccc 


tggcccc tec 




cacaatcatg 


ccctcccagc 


agtcctccaa 


35 


agtctaagtt 


caaagtctca 


cctgagacaa 




caaaaacaag 


ttagttactt 


ccaaggtaca 




cattccaaat 


gggacaaatt 


ggecaaaaca 


40 


acccagtagg 


gctgttat ta 


aattctaaag 




tcacatctgg 


ggtacacaga 


tgcaagaggt 


45 


ctgtggctct 


gcagggtaaa 


gcctctgcag 




cagcttttcc 


aggtgcgttg 


tagaggctgt 




atggtggccc 


tcttctcacg 


ggtctagtgg 


50 


ctcaacccca 


catttcccct 


ctgcagtgcc 




cctgtcacag 


ccttcttcct 


ggacatccag 


55 


ggagcccacc 


aagcctcaac 


tcctgacctc 


agccaccaag 


gcttggggct 


tgctccctct 




cttttagcca 


cagctggagc 


tggtgtggct 


60 


acagagcagc 


agcaggggcc 


ctggacctgg 



cagatagtga 


ggaatttttg 


ccccaagatg 


35880 


gaatctagat 


gctataactg 


atttagatga 


35940 


tttaggaaaa 


gtttggacat 


agagttgatt 


36000 


gaaattagct 


gaatatattt 


tgcacatggg 


36060 


gactgttatg 


ggttgaattg 


tgataccgaa 


36120 


cctcagaatg 


tgacattttg 


ggggaatggg 


36180 


atgaggtcat 


acaggggtag 


gtaggtcatt 


36240 


agaaatttgg 


acataggtac 


ataaggaaga 


36300 


gctataaaga 


actgactgag 


acagagtaat 


36360 


agttttgcat 


ggctegggag 


gtctcacaaa 


36420 


agcaagtacc 


ttcttcacaa 


ggaggcagga 


36480 


tttaaaccac 


catatctcat 


gagaactcac 


36540 


ctcccatgat 


ccaattacct 


cccaccaggt 


36600 


gagatgagat 


ttgggtgggg 


acacagagcc 


36660 


caaacttcat 


ctcctgctca 


catttcaaaa 


36720 


agtcttaact 


cattccagca 


ttaattcaaa 


36780 


ggaaatttcc 


ttccactgag 


cctgtaaaat 


36840 


atgggggtac 


aagcattagg 


taaatgttcc 


36900 


aaggggecac 


aggtcccatg 


caagtctgaa 


36960 


cttcaaaatg 


atctcctttg 


gctccatgtc 


37020 


gggctcccaa 


ggccttgggc 


agctccaccc 


37080 


ctgttttcat 


aggctgatgc 


tgagtgcctg 


37140 


cagtggattt 


accattcttg 


ggtctggaga 


37200 


gcagtgcccc 


agtggggact 


ctgtgggggc 


37260 


ctagtagaag 


ttctccatga 


gggctccaac 


3732 0 


gcattttcat 


acatcctctg 


aaatctaggt 


37380 


ggtgcaccca 


caggcccaac 


accatgtgaa 


37440 


gaagcaacag 


cccaagctgt 


accttggccc 


37500 


gggaagcagg 


gccccaagtc 


ccgagactgc 


37560 


gccacaaaac 


cattttttcc 


tcctaggcct 


37620 
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ccaggcctgt 
cccattgtct 
5 cttgaattta 
tccaaatttc 
tgctcacaca 

10 

tgcttagaaa 
aaatctctag 
15 ctttgctcca 
cattgtccat 
cctaactttc 

20 

ctatttccca 
ctcctggtac 
25 actgggtaat 
gcctcaggaa 
ggaagcagga 

30 

atgagaactc 
acctcccacc 
35 gggacacaga 
attacaagtc 
cctatatgcc 

40 

cagacccata 
atattgccct 
45 atacttagtt 
aatatttact 
tgaagatttt 

50 

catcttagag 
tgaaaaacac 
55 cagtttataa 
tgcttcagaa 
tgtgtggaat 

60 

tacatctccc 



gatgggaggg gctacagtga 
tggctattaa tatttggttc 
tccccagaaa atgagttttt 
tatgctttgc ttccctttta 
tatgtgaaca cacctctaga 
tttcttccac cagataccct 
ggcagggcaa aatgccacca 
gttcccaaaa agttcctcat 
atcactatca gcatttcagt 
ccacatcttc ctgtcttctt 
gttccaaagt tgcttccaca 
caatttcctg tgttagtcca 
ttataaagta aagagggtta 
atgtacaatc atggcggaag 
gagagagagt aaagggggaa 
actcactatc atgagaacag 
agatcccccc tcaacacatg 
gatgaaccat atcaccatct 
aaaaacacct ggggctacca 
ttagaaagag catgatcctg 
aaacaataaa cttctgttgt 
aggaaatgaa tacatgagga 
gttatgaaaa tacccaaaaa 
cctgattttc taaatgttga 
tttattcaga tggttggtat 
atgaccattg taatcataat 
actgaagaaa gtgagtaact 
taagggccct ctctgagagt 
gtaaggtcaa gtgtatctga 
tgaagtcaaa caacttgccc 
aggagattaa cagaggaaat 



agatgtttga catgccctgg 
cttgttactt atgcaaattt 
cttttatacc atgtatcagg 
aacacaactt ccagtttcag 
aaaaaacagg tcacctctta 
aaatcatctc tctcaagttc 
gtctctctgc taaagcatag 
ctccatctga aaccacctca 
caaaagcatt caacgagtct 
ttaagccctt caaactcttc 
ttttcaggct atctttatac 
ttttcacact gctataaaga 
attggctcac agtttcacat 
gcaaaggaga aacaagtacc 
cttccacaga cttttcaacc 
catgggtgaa actgccccca 
gggattacag tttgaaatga 
gaacatagag gcagagatag 
gaacctggat gagacaaggg 
ccaatacatt tattttgaac 
gttaagccac ccagtttgtg 
caatatgtaa gtactcaata 
taatcataat tgtgtgagac 
tcttatataa ttttaaatgg 
gatgctttta caatatttac 
catccagtgg gttgtaagag 
ttgtttctgg cagtcaaatc 
tcaaagttta gacttggtga 
aaacatgctg aaatatcttc 
tggcaaaact atgtagaagt 
atatagacta aaagggtttc 
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agacattttc 37680 

ctgcagctgg 37740 

c tgcaaat 1 1 3 7 80 0 

ataatctctt 37860 

aacactttgc 3 7 920 

aaagttccac 37 98 0 

caagactgac 3 8 040 

gcctggactt 38100 

ctaggaagtt 3 816 0 

caacttctgc 38220 

cagtacccca 38280 

gctgcctgag 38340 

ggctggggaa 3 8400 

ttcttcaaaa 3 8460 

atcaagtctc 3 8 520 

tgatctaatc 38580 

gat ttaagtg 3 8 640 

gagttatact 38700 

aggattctct 3 876 0 

ttttggcatc 38820 

gtactttatt 38880 

aat tt aaaac 3 8 940 

tgctaggatg 39000 

ttaattaata 39060 

tactctgcac 39120 

catttcatca 3 918 0 

aagataaagg 39240 

acaggagtgt 3 9300 

tgcttgatat 39360 

cttttagact 3 9420 

agagatcata 3 9480 
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cttgtgatat ggggatactg actgctataa acccaaaaat tcaaatttag ttctgaaatg 39540 

ttggtttcca acatttggaa tatcaaatgc cttcagaaaa taaacatttt agaggacact 39600 

5 

gaaaagagta cttctagaat agaacagtaa gaacctccaa aaacccatct ctcaataaaa 39660 

gcaatgagaa tactgaaaaa tttgtcaaaa tcaacttttt cagaactttg gaaattagtc 3972 0 

10 aaagacctat gaaaatccaa taagcattta ttcaagttaa atggccaaat catggtaaga 39780 

acagtgagct ctatggtatt gcagcgtgcc ctattcccat ccgtcttacc ctatctcctc 39840 

agtagccttt aaaagtaaca gccttgcaac catagtacct gtgggaaaaa acggcctaac 39900 

15 

aacaacagga aaggataaaa taggatttaa gtgccccaaa aaagccccat tctcagagaa 39960 

ttgttgttaa cttgtctggc agctccccag acatccccac tggcaagact ctttaatttg 40020 

20 atttcattca gaactcagtc actgtaaaca gcttttcttt ctgggatatt tgtcagaaca 40080 

atcaatagca tttaacaaca cagctgcctg aggcagagat aacagttgtg ccaaacaaaa 40140 

ggtcacattt taaaaaaaaa tctgagcagt aagatgtcca tagagtattt tgaaattatg 40200 

25 

acaatatctg gaaacataaa aggtcacatg catatgctgg gatgtgtaca tgccaagaaa 40260 

agacctgaaa aatgtcctaa cctcccacct ctggctgacc tgaagaaaat ccaattcaac 40320 

30 cattcaagtt aaatggccga atcatggtaa gaacagtgag ctttgtggta ttgtaacttg 40380 

tcctattccc atccctcaca acccatctcc tgcttttatg ataaaagcag gaagtgatag 40440 

ctaaggcaga gttgcaaact ctcccagagc tttaaaacct taccccaaca caaacacatg 40500 

35 

caaacacgca cacacacaat ctctcagcaa agtctaggaa acttactggt tcaaaagaat 40560 

taaggaaatc tttgtccaat cactagctga ccactaagtt aacaaagcag aataacaaag 40620 

40 acaatgataa aaccaacccc tagagggcta tgatttacaa tgttgccaca ctatattatt 40680 

ttaaatgttc aatttttaac aagtaaaaat ataaaatatg caaggaaaca gagaagtatg 4074 0 

gcttaaatac aggagaaaaa aacagtcaat aaaaattttc cccaaagaag tctaaatgtt 40800 

45 

gaggttacta gaaaaaaaag gtattttaac tcagcaagta taaataagta taaagaactt 40860 

ttaaaaacca tgtctaaaaa aactaaaagt atgaacgcaa tgtctcacca aaaagagaat 40920 

50 attaatagtg acataaaata ggaactattt ttccaaaaga accaaataga aattgtagag 40980 

cagaaaatta taataactaa aatgaaaatt ttactcaagg gtctcaacag aaaatttgag 41040 

ctggcagaag aaataatcca caaacttgac agtaggtcaa ttggaattac ctaatctatg 41100 

55 

aaacagaatg aagaaaaaag aacagagctt cagacacctt tgaaacatca gcaagcatac 41160 

caaaatgtgc ataatggtga acacataagg aaagaagaga gaaagaaata aaaaaatggc 41220 

60 acagtgagtt tgtgaagaaa tattggaaaa tgtatagctg taaagactat attttgaata 41280 
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agtatctgaa atcaaaaacc caactttccc 
ctaaagtgaa aataataaag actagagtag 
5 tagagaaaaa caataaaacc agaagttcct 

attaaccaca ctgaccaaga aaaaaagaaa 
agaacattac tgacaatgtt atagaagtaa 

10 

agataaccta cataaaatgg acaaatttct 
agaagaaata gaaatctaag tacacctatc 
15 aactttccat aaagaatagc ccagtctcag 
taaagcatta atactagtta ttcacaaact 
cttcctaact cattctataa ggctaacact 

20 



30 



ataataaaag 


gaaactatag 


actaatatcc 


aactattaac 


aaatggaatc 


tagcaacata 


gatttatcct 


aagagtacag 


agctggttca 


ttactaatag 


aataaaagcc 


agaaaccaca 


ttaagaaaat 


tgaacaccct 


tttatgatta 


attttctcaa 


cctaaatatg 


ggcgtttatt 


aagagacact 


gaatcatttt 


cttcttaaga 


caacattgta 


ttgtaggttc 


tatccaggaa 


ttaatcaagg 


aaaaattaaa 


actaacactc 



aaatgctaaa aaaatccata aaaatctagt 

40 

agaatacatg gccaatatac agaaaccaac 
gtaacatcaa aaggaataaa tatacatata 
45 aagtgcgaga cttgtctacc acaaactcta 

caaacaaatg aaaagacatc ccatgttcat 
atgatggtaa tactccccaa ccaatttaac 

50 

ttcagaaatt gacaagctta tcctaacatt 
caaaccaata ttaaaaaaga aaaacaaagt 
55 gggttaaaga aagacacaca cacacacaca 
taaaaagggc acctcagcag caaatgcagg 
ggggaaccag tgtaatgcca gagcccaccg 

60 

tgggtgggag gggtctaggc agtacggctt 
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ccttaaaaaa 


tcaggaaaag 


aaaactaaac 


41340 


aactaaatga 


aatagaagat 


aggaaaacaa 


41400 


tctttaaaaa 


gaatcccaaa 


ataacaaacc 


41460 


gaagacttaa 


tttactataa 


ttataaagaa 


41520 


aaatattgaa 


caatcatatg 


gcaacaaatt 


41580 


agaaatacag 


acatgactga 


aactgactca 


41640 


acaagttaaa 


gaatcaaatt 


aataatcaaa 


41700 


atagctacat 


tgtgaactct 


accaaaggtt 


41760 


ttttcccaaa 


aaaaagagga 


ggaataaata 


41820 


agtttggtac 


taaaataaaa 


caaagacatt 


41880 


attatgaata 


tagatgcata 


aatcctcaaa 


41940 


caaaaagtat 


tatatatcat 


gaccagttag 


42000 


atatatgaaa 


actaatcaat 


gtattccacc 


42060 


tgatcacctc 


aaaagacata 


gaagaagcat 


42120 


aaaaagttga 


aaaactagga 


atagaatgga 


42180 


aaaaatttat 


agttaacata 


atacttgata 


42240 


taagaaacaa 


gatatcttgt 


cacttacatt 


42300 


aatcaggcaa 


taaaaaataa 


agaatttagt 


42360 


tttacaaatg 


acattatctg 


atctcgtata 


42420 


cgaactaata 


aataagttta 


acaaaatggc 


42480 


tgtatttttg 


tatacaaaaa 


cacagttaca 


42540 


cttcacattc 


aggaataaat 


ttttaaaaag 


42600 


aaacattatt 


gaaagaaatt 


taagaagata 


42660 


tgataaaaga 


caatatttga 


agacaatatt 


42720 


acaacccctg 


ttaaattaca 


gctggcattt 


42780 


catatgaaca 


tggaaaagac 


ccaaaatagt 


42840 


aggaggtgtt 


aggttttgaa 


gggaaggtaa 


42900 


cacacacaca 


cacacacaca 


cacacacata 


42960 


c tt tacatcc 


agcataaaac 


ctacagaagt 


43020 


ctgcttatag 


cctgggggta 


cttataggta 


43080 


gctacccagc 


aggatattga 


taagatgttc 


43140 
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ccatgatgag gtggttctgg cccttggttt ggcagaatgt cattatggtg ttccttggac 43200 

ctttgcccag caagatataa tagggatgtt tctttagttg ggcctttgtc tgccttgtgg 43260 

5 

tcatgtggtt aggcagaatg tttctcacgg cccgaactcc tgtgaaatgt ttcactttga 43320 

ccaaggtctg caaaatagca gagaatttac aaaatggtgc agtttggact aacaggaagg 43380 

10 ctcgcacttc ctagtttcaa aagttactaa aaaattacag taatcaagaa attgtggtgt 43440 

agcaggatga gccgtggaca aaactccaca gacactgaga aagtgaagga agtagcttta 43500 

atcagctgga agcatcggca gactagcgtc ttaaaatcca agcttgttga gtgcacaatt 43 560 

15 

tctgtccctt tttttttttt agcgctcaca atactaaagg tttcgcatga aagggttgtg 43620 

attgattgag cagtctaggg tggtaccgga caggggctgc agctacatgc accagtaatc 43680 

20 agagtgaaac agaacagaat ggaaaatttg ttgtatgtcc ttccacacaa tgtctggaat 43 74 0 

ctatggataa catcagttgc taagtcatgg gttgaatttt aaccatcagg ctaaggtcag 43 800 

gcaggcccag gcctggtttt gggtctggtt ttgggtctgg tgcctggcgc caggctgcct 43 860 

25 

gcctttggtt tcgcttcctt gtttcttctt aaaacaggta ctgagtataa aacagtatag 43920 

aacaatatgg gggggtctct ttctttcttc tctcagtgat actgccataa tgatagacat 43 980 

30 aaaaatcagt agaataggat caaaaaccca aaaataagtt catacgtagc tacttgattt 44 04 0 

ttgactagca tgacaagaca attcaatgga gaaatagtgg tctcttcaac aaatggtgcc 44100 

aggacaactg gatatctgca tacaaaagaa tgaagttaga ccactacctc acagaatata 44160 

35 

cacaaattaa ctaaaaatgc atagacctag atttaagagc taacattaca aaactcttag 44220 

gaagaaaata caggcataat tatttctggc ctgaattagt taatggtata ttagctatca 44280 

40 caccaaaagc atatttaaca aaagaaaaat catagataaa ttgaccaagg gaagaaaact 44 34 0 

acagaccaac atgtctgatg actacggata caaaaatctt caaaaaaaaa actagcaaac 44400 

caaatctagc aacatataaa tatgattata caacagggtc aagtgagatt tattacagga 44460 

45 

attcaaagtt agtttaacat accaaaatca atcaatgcaa tatatttcat taaaataata 44520 

aagggcaaaa acatgctcac taaacaaagt gatgaaagga aatagattgt aatggctgat 44 58 0 

50 gattttatct ccttacttat ctatcttccc taaatccttt caaaatagat aattaattcc 44640 

aaaaacaaaa atctgtagtt tggtgtcaac ctaaaactct gtcaggagcc aaaggaatgg 44700 

cttgtttgtg gaattcctgt aggataatat gtgggggatc atggcaggaa atgaggctgt 44760 

55 

agaagtacct tgagaccaga tcttaatggg gcttaactat accttattaa agaattttga 44820 

atttagtcta tagataccag ggtgctcttg cagggttttt aacaacagtg tttcttgatc 44880 

60 aaacttttca tttatataat tattttcatt agaatatttc ctgcagcaag ttacagccct 44940 
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tcccccctac cctcaactca aactgctaaa 
cggcttaggc agcataattt ccggttcaca 
5 gctacaagga gacctctctt tacttcactt 

gagttgagta atagcaaagg gaataggatt 
gcaggataat tttgagggag tcaaccagtg 

10 

tagagatagg aaaaagaaga acaaagctag 
aggcaagcca aacaatgagt acctaaacct 
15 atttgaaaga tatttaggag gcttattaat 
aaatcttaga ttaaataaaa ttagtttaat 
tactgctcaa taactattga acatagcttt 

20 

aaactatttt ttaaatcatg tattttcagt 
ttctgctgtg aacttaaaaa caaaatagaa 
25 actgttccca ttagaatggc ttctacagtg 
tggctctccg ctccaatggt tgttctcttc 
aggaatcagg actcaccaga gtccagttaa 

30 

tatgtttgtt gtagttcatt gaaacctctg 
tttggctagg ccccccagaa ctgagtaggt 
35 acattccagg tttacatatt tgttcatttt 

tgtctcagga taaggctttt attttagact 
atctcaggca ctttgccctc ggaaatctca 

40 

acaacttcta tttcttcata atttgcaaag 
attaattgtg aattagcgaa gaactctaat 
45 ctaatgaaat cctttccatt gctctttttt 

tcacaaattg actcttccct ttttaaaagc 
cctgcaatcc ctgtggttga agggatgtag 

50 

aggttggagg agattggctt gaattaagga 
gattttggaa gttggcataa tgttcatgtg 
55 ttttgttttc tcatctgtaa aacagaaata 
gattaaatga caatatatca ttaaaagagc 
tcaataagtt tttattattt attatttatc 

60 

atatatgaag ccagattgaa tttttcttgt 
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tcaatcagga 


atcatctcat 


gatcataaga 


45000 


tccagcagga 


gaaactggcc 


tttctcagaa 


45060 


ggcttaattg 


cctcagatct 


gctaatctct 


45120 


accatgaagg 


attcagacaa 


atcatttgta 


45180 


tcctcagtca 


ttctgggagg 


aaaatagaga 


45240 


agatgaatat 


aattagctgg 


ttgccccaac 


45300 


aaacaatgaa 


gaataaagag 


aagatgataa 


45360 


gtcttgtatg 


ttcgtttcac 


actcaatgtt 


45420 


aaactgctta 


tcccttattt 


ctatttccca 


45480 


aaatactaac 


ctgtatttta 


cttgattagg 


45540 


tttgtgtgta 


tgcgcttata 


cgatataata 


45600 


aaacagaggt 


ttactcaaga 


aaagaatgac 


45660 


gctcccaaaa 


atcctgtagt 


atgagccaga 


45720 


tgatctagaa 


tatgaagtag 


agtacccaag 


45780 


aatcatttta 


agaagaataa 


tttagtttaa 


45840 


gtatctgatg 


agttataggt 


ctgatatgcc 


45900 


actgtcttct 


cagtacactt 


ttaactaacc 


45960 


ttgttgttgt 


tttgtcttac 


tttaactcaa 


46020 


gtaataatga 


tcaatttgct 


atctcaattc 


46080 


ctgtcggaat 


ccttggaaat 


cacaccacgc 


46140 


aaccagtgtg 


actccagtca 


agaacaaaag 


46200 


tttttcaagc 


agccacatgg 


aaaacgcatt 


46260 


cttagttctt 


caataaagca 


tgcgatttga 


46320 


cagtatgggt 


taacacggat 


aaatatagcc 


46380 


gagagtagtg 


ttgaatgccc 


actttatctt 


46440 


actccttcat 


ccaagaggag 


aaaatactct 


46500 


ataatgtcaa 


tactcttgct 


ctctctgtac 


46560 


atagtctcta 


acccacagag 


tagatgggag 


46620 


t tacaacagc 


tcctaactca 


tagttgtccc 


46680 


agctcttatc 


attgtatcag 


tagcataatg 


46740 


tttccaagaa 


ttggtgacta 


gaacctccct 


46800 
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ctgaaagcag atgtactgaa atgaggccac aaaactactt aaaatcttga tgaaaatgtt 46860 

atgtattgag ttgtgtcccc ctcaaaaaaa tatattgagg tcctaatccc cagtactcag 46920 

5 

aatatgacct aatttagaag tagggtcttt aaataaccaa tcgaattaaa atgaagtcat 46980 

taggagagag tctaatccaa tatcactggt ttccttataa aaagagaaaa ttgggccagg 47040 

10 cgcagtggct cacgcctgta atcccaacac tttgggaggc cgaggcaggt ggattgcttg 47100 

aggtcaggag ttcaagacta gcctgaccaa catggtgaaa tcctgtctct actaaaaata 47160 

caaaaattag ccgggcatag tggtgcgcac ctgtaatccc acctactcgg gaggctgagg 47220 

15 

caggagaatc acttgaaccc aggaggcaga gattgtgatg agactagatc gcaccattgc 47280 

actccagcct gggcgacaga gtgagactct catctcaaaa caaagaaaag gggaaattag 47340 

20 acacagacac agatatgcac agagggaaaa cgatgtgaag acacacaaag agaaaaccat 47400 

gtaaagatga aggattgggt gatgcatcga caagccaaga agtgccaaag atttctagca 47460 

aatcaggagg acgtcaggaa ggatccttct cctccaggtt tctgagggag catgaccctg 47520 

25 

agagcacctt gattttggac ttccaacctc cagaacagtg agataataaa tttctcattt 47580 

taacaccttg ttttaaccac tcgttttaac cactcgtttt aacaccttgt tacattagct 47640 

30 cggggaaact aatgcaggtg gcttttcact ataaggggca ctagatttcc cccaccccat 47700 

ggaacatgtt aggtgtttat gaaagcaggt tatcctctca cctcagcatt ccacagtagg 47760 

agtgcagcat tcgaacggct ctactctctg aagctcgggg agagaagaag cagctgtaaa 47820 

35 

gtgtgtggct gtttacagga agcatcccac tccccaacga tagtctttgg gacccaggtg 47880 

ggacttgctg acctgatcta gaaaagccag agcaaaagcc agcaatgtta aaaagaaggc 47940 

40 atggtatagg agcctgcaga gagggaccat ggaccactcc taaaagtcgg agcttgcaag 48000 

aggtaaaaaa gaaagatacc tatcaccact gcctgcaggc cccaaccctg gaacactaga 48060 

gggaaatgag actggaagcc tcaggttacc tgagaattgc tgtagcatgg ggagcaaggc 48120 

45 

atatgaaagc acttcagcaa aatgctgcca atggagggtc agtacgaagg tgtagccaaa 48180 

ttaccaaaga agcactaatg cctaggagag agcttctcag aattgagtgg ctcttaactg 4 8240 

50 tggccagtag aagcttccac atacctgcct aggttaggaa tataatccac agaaagagaa 48300 

ctatattctg acatgggagc tcatttaatt tcatttaatt tttttaaaat aaaggttttt 48360 

ttatatgtat aaaatttgtc attttatagg catattttaa atgtggatat tttaatttat 48420 

55 

atcataattg tataatttat atgcatattt taaatgtgca tattttgtaa cataattttt 48480 

aaagcctctg ctgtgaacat tacatattga atgtatatgt ttgggttgtc tgttgttttt 48540 

60 cagtcttatt ctgatgttat ctccaggagt ccaccataca atttctccag agcctctgtt 48600 
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acatctatct 


tcttccaatc 


catccttgat 


acacttttaa 


agataacagc 


acccaagatc 


48660 


atgttaagta 


ctagagactg 


aacatatatc 


tctacagata 


gtgggatctc 


tcatagttca 


48720 


aactttgtgt 


gggatagttt 


tctaattccc 


cattccatgt 


aataacaaaa 


gaaattaata 


48780 


ctgtatttgc 


cctggttaat 


tatgagacta 


tttctcactt 


tttctacttc 


agcactctgt 


48840 


aattggcctc 


agcctgttct 


gcctacacag 


attaacatta 


tatcctcctt 


cctcctatat 


48900 


tagatgaata 


agtctgttct 


tttattatct 


cattctattc 


tttattattt 


ataatagtaa 


48960 


ttcccattaa 


tacctctgct 


ctttcttttt 


tatatatttt 


ttgtacatat 


gacttctgat 


49020 


ctttttctct 


tacatacaaa 


gtattattat 


aaaaatgtac 


aggcatatct 


aattattatg 


49080 


tatttcagta 


tattcaggtc 


tgacatttta 


tataataaaa 


tggatatttt 


ataataaatt 


49140 


tacttgccct 


ttatattaca 


gttaagcatt 


atattgatct 


cttaaattat 


gtagatctca 


49200 


ttatttgtag 


tttgtatctc 


aggatatgaa 


aaatattaac 


tctacattta 


ttttactgag 


49260 


aaatttttaa 


accatctaga 


ttttgactaa 


aagtcttagt 


acagactttt 


aattccatcc 


49320 


aatgtgaaat 


aactgatatc 


agatatactg 


aaactagcta 


caaaaactgg 


ggaaaatgca 


49380 


taaaacatca 


atttttaggc 


actgtacaaa 


aaccaacaga 


aggtttccat 


tctttggaaa 


49440 


tgggaaacat 


acaaggtgag 


cccccacatt 


cacccagact 


tccccttggg 


aacattttcc 


49500 


aggccatgtg 


gtaggaacgt 


agaacccaaa 


tagttacagt 


ggtcttgctg 


gatggaagca 


49560 


gagattaagt 


ttgaaactgc 


taaagtagct 


ggaattcgat 


gggcatacct 


acagagatag 


49620 


ggaagctact 


gaacagatac 


ctcagaagtc 


tacaaagaaa 


ttttccttag 


gtccttagct 


49680 


gaatcctggc 


catatccttt 


aatacaactt 


cagaaggcct 


aacagaaagt 


aaattcctgg 


49740 


ccgggtgtgg 


tggctcacgc 


ctgtaatcac 


agcactttgt 


aagggaggcc 


gaagagggca 


49800 


gatcacttga 


ggtcaggggt 


ttgagaccag 


cctggccaac 


atagtaaaac 


cccatctcta 


49860 


ctaaaaacac 


aaaaattact 


caggcatggt 


ggtgggcgcc 


tgtagtccca 


gctactcagg 


49920 


tggctgaggc 


agaagaatca 


cttgaaacag 


gagacggagg 


ttgcagtgag 


ccgagatcct 


49980 


gccactgcag 


accagcctgg 


gtgacagagc 


aagaccctgt 


cttaaaaaaa 


aaaaaaaaaa 


50040 


gaaaaaaaaa 


aggaagtaaa 


ttccagggag 


actgagagct 


gagcatggat 


tcttgatcct 


50100 


acacagtgtg 


gtgagtggtg 


aagtgcaggc 


gcagccaaaa 


gagagaaacc 


ttcttgatca 


50160 


tccctgacat 


tcaaaggata 


ctccagagat 


gctacagctt 


ctaattaaga 


acgatgaact 


50220 


aggagtacag 


gctacacctt 


ggaataagaa 


aaagggagaa 


aaagacgcac 


cctaacaaag 


50280 


cctaaaatca 


ggatgcaaga 


tgatctgcaa 


aaatgtaatt 


gctggtgtgc 


ctcctccaag 


50340 


gagaggaaag 


agaatagtac 


ccagatcctc 


acattttgaa 


cagatcatct 


aggaaagaac 


50400 


acttggatac 


agcagagaac 


agacgagaaa 


caccagcagt 


aagaaaggac 


atggtgtgag 


50460 
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gaagcttgcc cagccaggaa ctgaccaaga gccaagagag gctcctggac ctagggaaac 5052 0 

aggaagagag aaacccctag aactccaaaa tgggctttta tgatcttggc tatgggagaa 50580 

5 

accctgtagt ggattcacga cagcatcagg cctggcatat ggagcttcct aaagattgca 50640 

cagagatgtt gccccagaaa gggaacccac acagaatccc acaagcttca gcacccagag 50700 

10 cagcctcagc tgggagccat tttgagagcc tggataccgg gtatctacag acacagctgc 50760 

tgccactgag cagctccaag gagggagagg gaagaacagg tgctcccatg caccaatgag 50820 

agggtacctg ccgccctgct atgggctgca gttgagactg agacataaat gagccacgtt 50 88 0 

15 

ccccacagct tcttgctcac actgcttgcc tgggagatgc cccaccgtct ctggtctaaa 50940 

ccccaaggca ccatcttaag agttgagtgc tggactgtgc cctgcccttg ggctgaggtc 51000 

20 aaactaacac aattacagct gctgcctagc caaggagtga cggggaaacc aggctatctc 51060 

atgcaaatat aggacaatac ccactgatct gcaacaggct gctgtgagac tgagacgtga 51120 

ggggatgaca ctccccacag cttcttacat atgcttcttg cctaggaggg accccaccct 51180 

25 

ccctggccac aggcctaagg tgccattttg agagtttaat gctgggctat gccccacggt 51240 

caggccaaat tcaagttgac atggctgcaa ctactgccca gtcaaggagg gatagggcag 51300 

30 ccaggctttc cgacgcatac ctaggacaat attcaatgcc ctgccatagg ctgctgcaag 51360 

accaacactc aagcgaacca cacttctcac agagtcttgc ccatactgtt cgcctgagaa 51420 

gggcctgacc ctctctggtc acaagcccac agctggcact atttttagag tttaacacta 51480 

35 

agctgtgccc cacccttggg ccaaggtcaa ggtgacacaa cagcagtcgc cacccaactg 51540 

agagagagac aaggaagacc aaggtctcca aagcacactt agaacaatac ccactgccct 5160 0 

40 gctaaaggca gctgtgggac cagggagtaa cccaccccaa tccattgcag cttccagcaa 51660 

caccaacaca gatggcatgg gtcactgtgg tttcctccac cactgctact gccacaccac 51720 

gccagcttcc caggggcctg agactccacc cacacaccag gcccactgct cccactgccc 51780 

45 

acttctaagc aagcagcttg gaggcccaag aatgagccct ctaggaccgc taacacaagg 5184 0 

gccagtataa gccactctgg agccttaaaa caggctcgct caccccaatg ctgccactac 51900 

50 tcgggcccaa agactggctc agttggtgta ttagtttgtt ctcacactgc tatagagaaa 51960 

tacctgatgg ccaggcacag tatttgggag ggtgaggcag acagatcact tgaggccagg 52 02 0 

agtttgagac cagcctggac aacatggtga aaccctgtct ctactaaaat acaaaaatta 52080 

55 

gatgggtgtg gtggtgcaca cctgtaattc cagctacttg ggaggctgag gcatgagaat 52140 

cacttgaacc tggaaggcgg aggctgcagt gagctgagat cgcaacactg cactccagcc 52200 

60 tgggtgacag agtgagaccc tgtctaaaaa aaaaaaagaa gaaagaaaga gagagaggga 52260 
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gagagagaaa gaaagagaga aagagagaaa 
gaaagaaaga aagagaaagg aaagaaagaa 
5 aaagaaagaa ggaaagaaaa agaaagaaag 
agaaagaaag aaagaaagaa agaaagaaag 
aggaaagacc tgagactggg taatttaaaa 

10 

cggactgtac aggaagcatg gcagcatatg 
catgacagaa ggcaacatgg gaacaggcat 
15 ggggtgcccc acacttttaa acaaccagat 
agtaccaagg ggaaaatctg ctcctgtgat 
acactgggga ttgcaattca acatgagatt 

20 

gttggcatct aagtccccag ctaaaagtca 
aggcaatccc agagaccact gactttgtgt 
25 tactgccagc atgcaaaatc aaagtcgaag 

agaaaaaaaa aaaaaattcc cctaccaaag 
tacactagat gcaaagacat caatggaaaa 

30 

ggatatcatc aaaggaccat aattgtccag 
aatgccagat aaataattta aaatattgat 
35 tatctgaaaa tcaggccagg catagtgtct 

tttgggaggc catggcaggt agatcacatg 
catactgaaa ccctgtcttt ctactaaaaa 

40 

aatcagtaaa tcaattcaga aaatgaaaga 
ttttgaagta attcagccag acaaaaataa 
45 cctttaagat atctgagatg atataaaaca 
99999 ta 9 a 9 agattcaaaa gtttagaaat 
ccagcaattt gggaggccaa agtgggagga 

50 

taagcaagaa aacaagaatc tgcctttaca 
tggctcatgt gtgtcatccc agctacttga 
55 aggagtttga ggttgcaatg ggccatgatt 

gtgaaacaaa acaaaacaaa agcttagaaa 
tcccaagaat atcaagagag ttagacataa 

60 

aacacattgc aaaaaggatc tcaccatggc 
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ggaaggaagg 


aaggaagaga 


aagaaaaaaa 


52320 


agaaagaaag 


aaagaaagaa 


agaaagaaag 


52380 


aaagaaagaa 


agaaagaaag 


aaagaaagaa 


52440 


aaagaaaaag 


aaagagaaag 


aaagaaagaa 


52500 


gaaaataggt 


ttaattgcct 


ggtggttcca 


52560 


cttctgcaga 


ggcctcaggg 


agcttttcct 


52620 


cttacatggt 


aggagcagaa 


ccgaggcggg 


52680 


atcacacaaa 


ctcactcact 


attgtgacac 


52740 


tcaatcgcct 


gtcaccaggc 


cccacctcca 


52800 


tgggcaggga 


cacagaccaa 


aaccacatca 


52860 


ccacaacctt 


aactgtacca 


taagtcacca 


52920 


actgcgaaag 


aagtcaaaca 


aagatcacac 


52980 


tatcttaatc 


aacaacacgt 


acacaacctc 


53040 


caatttcaaa 


aaattggaac 


aagcaactgc 


53100 


acacaggaag 


catgaaaaag 


cagggaaatg 


53160 


caacagattc 


caacaaaatg 


aatcccttga 


53220 


ttttaaagaa 


gcttaatgag 


atgcaaagga 


53280 


cgtgcctgta 


atcccactgt 


aatcccacac 


53340 


aggccaggag 


ttcaagacca 


gcctgggcac 


53400 


tacaaaaaaa 


aatgtaagct 


aatacaagaa 


53460 


aaataaaatc 


tcagtacctg 


aagacatgtc 


53520 


ggaaaaaaga 


attacaaagg 


ataaacaaag 


53580 


accaaactta 


caaattatca 


gtacacccaa 


53640 


ggccaggtgt 


agcggctcat 


gcctgtactc 


53700 


ctgcttgaga 


tcaagagttc 


aagactagcc 


53760 


aatttttttt 


tttaaattag 


ccaggcatgg 


53820 


agggctaagg 


tgggaggatt 


gcttgagccc 


53880 


gtgccactgc 


actccaacct 


gggtgacaga 


53940 


acatatttga 


ggaaataatc 


aataaaaatg 


54000 


agatacagga 


agtccagtga 


tacccaggca 


54060 


aatattatat 


tcagaatgtc 


taaagtcaat 


54120 
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gagaaagaaa gaattttgtc agtaacaata caaaagtttc tagtcaccta taaagaaaac 54180 

tccattagac taacagcaga cttttcagca gaaatcttac aggccagaag agaagggaat 54240 

5 

ggtattttca aagtgcttga acaaaagaac tatcagccaa gaattttgtg tcctgccaaa 54300 

ataggtttca taaataaagt cctttccagg caagcaaaca ctgagggaat ttgtcacccc 54360 

10 tacaaatggt caccctacag gaaatgctca aagaggtctt aaacatgtga atgaaaagtc 54420 

aatatccatc atcattaaaa cccatggaaa tataaaactt acagtttgta taaaacaatc 54480 

acacaaagga ggaaaagaaa ggaatcaaat ggcaacatga caaaatttta ctaaaccaca 54540 

15 

aagacaaaaa gacagaaaca aagaatctat aacttgaaat cgatcaacaa tatgacagga 54 60 0 

acaaagcttc acatatcaat attaaccccg aaagtaaatg gactaaatgc tccactgaaa 54660 

20 agatgcagat tggcagaatg ggtttaaaaa ataatccaag tatatgctgc ctacaagaaa 54 72 0 

ctcaccttac ccacaaaaac acatataggt tgaaagtaaa gaggtggaaa agatattcca 54 780 

agcaaacaga aataaaagca agcaggagta gttatcctta tgtcagataa aatagacttt 54840 

25 

aaatcaaaaa cagtaagaaa gaacacagaa gggcattaca tagtgataaa gtgatcaaga 54 900 

agatgtaaca atcctaaata tatatgcacc caacattgga gcacccaaat ttacaaaaca 54 960 

30 aatattactg cagaaaacca aataccacat gttctcactt ataagtggga gctaaatgat 55020 

gaaaactcat caacacaaag ggaacaatag acactgagat ctatttgagg gtgaagggta 55080 

ggacgaggaa gaggagcagt aacataacta ttgggtactg ggcttaatac ctgggtgatg 5514 0 

35 

aaataatctg tacaatgaac ccccatgaca taagtttacc tatgtaataa accttcatgt 55200 

gtacccccaa acctaaaaaa acacacacaa atattactac acctaaagaa agagaaagac 55260 

40 gagaaaggca gcaacacaat aatagtgggg gacttaacca ccccacttgc agcactagac 55320 

agatcatagg gagaaaaaat taacaaagaa acattggcct taaattggac tttagaccaa 55380 

atggatttaa caggtatttg tgcagtggtg caatcttggc tcactgcaac ctccacttcc 55440 

45 

tgggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgcacc 55500 

accatgcccg gctacttttt tgtgttttta gtagagacgg ggtttcagca tgttggtcag 55560 

50 gctggtcttg aactactgac ctcagatgat ccaccctcct cggcctccca aaatgctggg 5562 0 

tttacaggca tgagtcactg cgcctggccc agaatatgca ttatttttat cagcacatgg 5568 0 

accattcccc aagatagact acatgttagg ccacaaaact agccttaaga aatttttaaa 55740 

55 

aattgaaatc atgtcaacta tcttctcaga tgacaatgca ataaaggtaa aaatcagtac 55800 

caagagaaat ttcagaaagt atacaaatac atggaaacta aacagcatgc tcctgaacaa 55 860 

60 tcactgggtc aatgaagaaa ttaagacaga aatttaaaat ttttatgaaa tgaaaagaaa 55 92 0 
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aacacatctt accaaaacct gtgggataca 
gcattaaatg cctacatcaa aaaaagcaga 
5 tctcaaggat gtagagaaac aaggagaaat 

aagatcaagg caggactaaa taaatcagaa 
tgaaaagttt gttccttgaa aagataaaat 

10 

gagagaagat tcaaataaac acaatcagaa 
cagcaataca aaaggctctg acataaggat 
15 ctcactgtga aaaaaaatgg aaaagtgata 
gtattaatga atgagttctg ttatagggaa 
agagacagtt gaaacacact tcaaaattat 

20 

ttatccagca tatgcacgtt acactggata 
cacctatggc ctgccccata cctatgcatc 
25 tggagactgg ctgaagatgc agcatcaata 
tgaggctgca gtgagctatg atcacgttac 
aagacctcca gagtggatca aggggatata 

30 

ggtttggaca ataataatct cccttgtatg 
cttctataaa ctcaattaaa aggtgacaaa 
35 ggtcctgaat gtcccacgtg cactagggct 

tcttattatt tctgtcaata cagtttggtt 
agttacttta tagagttaga caaatgctaa 

40 

tagccataaa atataatgaa aaaggaaaaa 
gaagtacact ggaaagcctc tataattaaa 
45 atagaccaat agaaagtaaa agaaagacca 

ggctgtgaca aaggtggcat ttcaaatgcc 
tgccctgggg cagccaggta gacttttgga 

50 

cattcacaaa aaaatacatt ttacatgtat 
acaagtatta gaagaaaaca cagctaaatt 
55 tcaaacaatg actcaaatgc ataggtaata 
aatttttaaa atttttcttg acagaaaacc 
aattggagaa tataccaaag acaaagacct 

60 

aaattaaggg acaaaggtcc aggaatctga 
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gtgaaagtag 


ttctatgagg 


gaaatttata 


55980 


aagatgacaa 


attaacagct 


taatattgta 


56040 


caaacccgaa 


attggcagaa 


gaaaaaaaaa 


56100 


atgaaaaaag 


aataaaaagg 


atcaacgaaa 


56160 


tgataaacca 


ctagctagac 


taataaaagg 


56220 


ttgaaaatga 


agacattaca 


attgatacca 


56280 


ttgggtgctt 


acttgggaag 


tgacccaaga 


56340 


ccaaaaaaag 


gaagaaagtc 


aatagactat 


56400 


ctggtgctta 


atcctgttgg 


agactctcta 


56460 


ttctcttagg 


ggacagagaa 


gctagggtag 


56520 


ggaaaaattc 


ttgtgggatt 


aacttcccag 


56580 


ctcggaaaag 


aaaggagttg 


tccatagatg 


56640 


cagcatttgc 


tgcagagttt 


gctcagagtt 


56700 


tgtactctag 


cttgagcaag 


accctgtctc 


56760 


gatgctgcat 


caacagcatt 


tgctgaagta 


56820 


tctccggtcc 


cttctagaaa 


gttggcacca 


56880 


tttcatcttc 


tacaatatct 


ggttaaacct 


56940 


tagccaccta 


tatgtcctca 


ttcagatgcc 


57000 


catgttattc 


gtctttacaa 


aaagctattg 


57060 


agttcatttg 


gaaaaacaaa 


catgcaagaa 


57120 


ttacacagga 


gactaaccct 


acaggacatt 


57180 


acattgttga 


aataacacat 


gaatagacaa 


57240 


gaattagaac 


caaatgcata 


tagaaattta 


57300 


gctgaagaga 


agatgagcta 


tttaataaat 


57360 


aaaatttaaa 


attagaccca 


catctcatag 


57420 


taagaatcaa 


aatgtaaaaa 


gtaaaacaat 


57480 


cctgtttaat 


cttagtgtta 


ggaaaggctt 


57540 


aaagataaga 


ctgataaatt 


ttactacaaa 


57600 


accctaaaca 


aaatcaaaag 


acactgtcaa 


57660 


aaaatcccta 


gtatgtaaag 


aactctttaa 


57720 


tagaaaaaat 


gggtaaagaa 


gtaaaatttg 


57780 
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ggttaggcat ggtggctcac tcctgtaatc 
tcatttgagg ccaggaattg aagaccagcc 

5 

aaaaatttta agatttagct ggccacagta 
ggctgaggca ggtggattgc ttgagcctgg 
10 actccagcct gagcaagacc' ctgtctctaa 

ctgactccag ctaccttttc atctttttta 
caacatatct ctaccttctc ttccccaaga 

15 

ctttttgttt tgaccatctt acaatgttct 
ttgttcacct tcttcaacgg taaatctgtc 
20 tgtgcaattc acacgatgta tgtcatagat 
gattttctat tggctctgac ttggatgaaa 
cactcgctgt gattctatcc ctttctggac 

25 

tgtttgaagt cttccctaat ctcttcaagt 
gtggggacat ttatctctct tctgtggtgt 
30 atacagcaca ccctgggtgc ccagttcatt 
ctttcttctc catggagatt ataccataga 
ctcattaacc aaaacaggaa ataaatacga 

35 

tggcctgcat ccaggaaata gaggacttcg 
ccagtacatt cattctctca ggaaaaaaaa 
40 gatcaagaga tacgtggctg ctggcagagc 

aactgggagt tggtgataag gtttttctac 
tgagggatag acagagtgta ttgtactagc 

45 

agaaagatag gacatggagg gggatgggag 
cattagaatg tataagtagc ctagaaaatt 
50 ctgccttggc aatggggctt cccttgagaa 

acctgggacc ttggaggatt cttgtccact 
agagaattga actcaatgtt gtccaagaga 

55 

agtactaaca tatgcaaagc agcactctaa 
catctgtaat cccagcactt tgggaggccg 
60 cgagaccagc ctggccaaca gggggaaacc 
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cctgcacttt 


gggaggccga 


ggcaggagaa 


57840 


tgggcagcat 


agtgaaatcc 


tgtctctaca 


57900 


gtgcgtgcct 


gcagtcccag 


ctattcagga 


57960 


gagtttgagg 


ctgcagtgaa 


ctatcactgc 


58020 


aaaatacaaa 


caacaacaaa 


aaagtgaagt 


58080 


cattcaatgt 


ccttcactgt 


gagccattcc 


58140 


cttttccctt 


tccataccct 


caacggtgtt 


58200 


agccatccca 


tcaacttcat 


ttgttcttca 


58260 


atggactatt 


acatgcagct 


gtttttgtac 


58320 


atgtggaatg 


ccctcccatg 


gatggcttct 


58380 


atagatctaa 


gttgattaca 


agtctcattt 


58440 


tgcagtttat 


tgatttcata 


tgccaccttc 


58500 


ccaacaaatc 


attttctcct 


ctgttctcct 


58560 


ttatttcatc 


ctgtctcatg 


ttacagctaa 


58620 


tctgtgtctc 


ccagagcaac 


cagcccaacc 


58680 


ctcttgtcaa 


atcaaaccat 


atttaactgg 


58740 


atgaaactga 


gctctaagca 


gcatgtaacc 


58800 


gatccttcta 


accctaccac 


ccaactggcc 


58860 


caaggtcccc 


acagcaaaga 


aaaggaatag 


58920 


aaggtgagtc 


tgcttaatct 


ggtggttggt 


58980 


caagtgagac 


ttacggcgaa 


ccagtgtgga 


59040 


tttattgagt 


ctgctccaag 


agaaagaaag 


59100 


aaacaaaaag 


aaaaagaatt 


tttaaaacca 


59160 


gagcagtgga 


agccacccaa 


tgaacagaca 


59220 


ggataatctc 


catttaacta 


atattgacat 


59280 


ctctcaccct 


gtttcctttt 


ccagaaaatg 


59340 


ttttctagtt 


ctgaactttt 


gctttcagcc 


59400 


acactatgag 


aggccgggtg 


cggtggttca 


59460 


aggcaagcgg 


atcgcctgag 


gtcaggagtt 


59520 


ccatctctac 


taaaaataca 


aaaattagct 


59580 
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gggtgtggtS gcgggggcct gtaatcccag 
ttgaacctgg gaggcggagg ttgcagtgag 
5 gtgacagagc cagactctgt ctcaaaaaaa 

aagatggaaa agatggaccc cttgccctct 
ttctcatctg taaaactggg cgggtgagag 

10 

cttaagtgcc ctataactga tatctattat 
ggttaagatg ctccaatgag ctagaagaag 
15 tgtgagggaa aattctctcg catttatttc 

ttccggtggc caattctgtg ttggtggtgg 
gaattatgtc tcacgtgccc ctgtatccaa 

20 

ggaacccacc tagtttggtg tcgaatgtga 
aaggcaaaac cttgggggta agtgagattt 
25 agtgatggca gggggaaggg aaggcctggg 
ggctgagttg atcgcgcctt gccaagcaca 
cacaggcggc ctctgctcag accttgtatt 

30 

gaaaagcaac aaaaaagcag cacgcttgaa 
gccctgtggc ctctgaagaa aggggtccca 
35 cagaatccaa taagatcgaa tcaaatattt 

ccaacaacat aaatggtcac cgcaacccct 
gcccatttca ctgtgtagct gaatgatggg 

40 

caaccctagc tccaccctca gatagtgctc 
gagggaaatt tgtatctgcc aaatttacaa 
45 atcctccaga tatgcctgtg cccacaaaaa 

cgtggtttgt gataacaaaa atattgtaat 
atattggtta agtcagtgta atatactggt 

50 

gtggagctat gaaaagaaca agaaagctct 
atggagttaa tggaagaaag taaggggcag 
55 gaatttaagg aagtaatata tttgtatttg 
agatgttctt tcatttacaa tgggattatg 
tcataagtaa aaaatacatt taatacatgt 

60 

ctaccttaaa caagttcaga acacttaggt 
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atactcagga 


ggctgaggca 


ggagaattgc 


59640 


ccaagatcgt 


accactgcac 


tccagcctgg 


59700 


aattaaaaaa 


taaacactat 


gagagacaca 


59760 


ggtagccttt 


agcctctttg 


agtcttagtt 


59820 


gatgaaagga 


aatgtatttg 


taaagcacct 


59880 


ttgtaacatg 


ggattcataa 


acttgtagag 


59940 


caaaagtcct 


ttttaaagga 


caagtccata 


60000 


ttggcagcat 


gaattcgatg 


acttcagcag 


60060 


caccccacaa 


tggttatcct 


gtgaccccag 


60120 


acagccagcc 


gcaagtccac 


ctagttcctg 


60180 


atgggcagcc 


tgtgcagaaa 


gctctgaaag 


60240 


ccctttgcag 


gccacagact 


gcacagctgg 


60300 


aaatacacaa 


acactacatc 


ccgcaagtgg 


60360 


ggtctattag 


aaagctcagc 


tgcacaacat 


60420 


ttctgattta 


ggcattttcg 


gaaagagaaa 


60480 


ggttgatcct 


gttgccagga 


tccacgccca 


60540 


gcaagaccaa 


aatagaccac 


aaggtggagt 


60600 


gcccaacagg 


aaacaaagca 


ggttaggcaa 


60660 


gagaggctgt 


cctgcgctgg 


acaaagaaag 


60720 


catgaatttg 


gatgcagact 


gcccgagctc 


60780 


atgggaacaa 


aaattgatac 


aacccttagg 


60840 


aatatccaat 


aattccattg 


ctggaacttc 


60900 


ctgtgatatg 


tatgagctta 


ttcattgcag 


60960 


atccatcaat 


ataataccgg 


ttaagtcagt 


61020 


taagtcagtg 


catccaaaga 


ataaaatgta 


61080 


ttgtgtaatg 


atagggacac 


accccaagat 


61140 


acactgtgta 


taatttgctt 


acatttatgt 


61200 


ctaccacgtg 


tgtaatatac 


aaacatacac 


61260 


tcccagtgta 


cccatcgtaa 


attaaaaata 


61320 


agcctaccaa 


acatcatagc 


ttagcctcgc 


61380 


tagcttacag 


ctgggcaaaa 


tcatctaaca 


61440 
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caaagcctat 


ttaatagtgt 


tcttgaatat 


5 


aatgaaaagc 


agaatgattg 


tatgggtact 


ctttcacacc 


ataataaagt 


tgaaaacttt 




cccaggctgg 


agcgcagtgg 


cgcagtcttg 


10 


anrrrah t~ r* t" r* 
^ y ct u. L- l.^ 


L^MV^U L- CT. V — . 


rfprraaata 




gagctagttt 


ttgtattttt 


gtagagacag 


15 


aactcctgac 


ctgtgatccg 


cccgtctcag 




gccactgtgc 


ctggcccaca 


tggcattttt 




acccactcta 


ctccagtgtg 


acctcatctt 


20 


Ldcia ucty dy u 


L-aLct l uyci 






gggttggggg 


aggggagcat 


aatttaacac 


25 


aggcaaagat 


ttacatagaa 


ggatgtttat 




aaataatctt 


acctgtccaa 


taactctgtc 




atctgagcag 


ccctgaaaaa 


acatatgttc 


30 


Lei L^ct Ly adci 


La l L-Ctciy i— ua. 


Cl C< CI O. CI I— L. \^ C3. 




tttgaaaaga 


ataatgggag 


aatacaccaa 


35 


gggagattac 


aggtagtttt 


tatttcattt 




gaaaataaaa 


catttattaa 


aattaaaagg 




agaaacaaga 


cttcagcttg 


tggctctcct 


40 




\*\* L-^yy^ L- o ^ 


at*ra t - nctroa 




cattctacgg 


aggctttccc 


ttctggggag 


45 


ccgatgagct 


cccagaaggt 


gccaaggcac 




tgccctgcag 


aaggttccca 


gccacccaaa 




cctcgttccg 


caagcatatt 


agtttggtag 


*so 


ccggacggct: 


UaaaCaaLay 


addLLLy L i_ ct 




atggagatgt 


tagcagatgt 


tttcccctaa 


55 


cttctccctg 


agcctttcca 


cggccttcct 




tgaggacacc 


agtcattttg 


gattagggtc 




cggttttaac 


aaccctatct 


ccaaacacag 


60 


tcaatctacg 


aaatttggtg 


ggggatagag 



ctaatgtagt 


ttactgaaca 


ctggactgaa 


61500 


ccaaatacgg 


tttctactga 


attcatgtca 


61560 


gtttttttga 


gacagagtct 


cactctgtca 


61620 


gctcactgca 


acctctgcct 


cctgggttta 


61680 


gctgggatta 


caggcaccca 


ccatcatgcc 


61740 


ggtttcacca 


tgttggccag 


gctggtcttg 


61800 


cttcccaaag 


tgctgagatt 


acaggtgtga 


61860 


tataaggaca 


ccaatcatat 


tggattggag 


61920 


ggctgattat 


atctgcaaca 


actctgtttt 


61980 


attagaattt 


caacacatga 


atttggagga 


62040 


ataacaatat 


cctaaagaaa 


taagagatga 


62100 


cacaatgtta 


catataatag 


aaaaaagtag 


62160 


tcaactactc 


cacagaccat 


ccataatggg 


62220 


caagaatact 


taatgccatg 


aggacacatt 


62280 


gggggccatg 


ttgtgctatc 


aaggtatata 


62340 


aatgtaaaca 


gttgttttct 


ccaggcggag 


62400 


ttgtatttcc 


catattactt 


tttagaccat 


62460 


aggctcatag 


gggaggcagg 


acccagcctc 


62520 


gacaggccat 


ccagatcatc 


attggcctgg 


62580 


cggttctcgt 


aggggaatac 


ctgtctattt 


62640 


gcttgtgggt 


gagtaactca 


agtcctcctg 


62700 


tcaaggccac 


cttgccaagt 


tgtgcagctg 


62760 


tgtgggggat 


aggaaaccag 


ccagccctgg 


62820 


ggccggccca 


acagagtata 


gcaccacagg 


62880 


tctcacagtt 


ctggcggctg 


aaagtctgcg 


62940 


ggtctctctc 


cttggcttgc 


agaaggtcgt 


63000 


ccatgcctgt 


gccctagtct 


cctcttccta 


63060 


ccccctgatg 


acctcattta 


acctgaatga 


63120 


tcagattctg 


gagcactaca 


gattagggct 


63180 


ggtgtggaca 


caattcagcc 


cataatatca 


63240 
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agctacaacc ccgcaaggtg 
tttttttttt ttttttgaga 
5 caatctcggc tcaccgcaac 

ccctagtagc tgggattaca 
tagtgacaag atttcaccac 

10 

cacttgcctc agcctcccaa 
aagggatgtg ttttcttaat 
15 tagcccctgg cattgttgga 

gcaaaggtga agagtggctt 
ggggactcaa tctagatctg 

20 

tatgctaaat gtgtattatt 
gatccgtgac tccataaagt 
25 tttatggttc ctttgcagtg 
cacacacata caaacaaaca 
ctctctcctt ctcctcccct 

30 

cttcctcctg actcacacac 
aactccagag taacctaggt 
35 tagcctagag ctatctccat 

tctagcgagg ctctgcacac 
ctcacaagat acctgttcct 

40 

ctagcatcac ggccacccag 
attccaccaa gtgccagtga 
45 ccacgcctcc tcccaccagg 

ctctgcaggg cagttgtgct 
ccaaagatca tcaatatcct 

50 

gactgagagg gataccaaaa 
ggtgtgagcc tgagatttgt 
55 cagaaacagc tggaataagt 

tctgccgtgg gtgggaggga 
ctgtctggat ttgattctca 

60 

aaactccgct ggatgatgtt 



acatcccctc 


aaaagggatg 


tattttcttt 


tggagtctca 


ctctgtcgcc 


caagctggag 


ctccccctcc 


cggattcaag 


caattctcct 


gacgcgcacc 


accacgcccg 


gctaattttt 


gttggccagg 


ctggtctcaa 


actcctgacc 


agtgctggga 


ttacaggtgt 


gagccaccac 


tcacaaagaa 


aacatccaca 


tgtaaaaaat 


cataggccca 


caagcctgag 


ctccaggaga 


cccagctgta 


gaacagcaag 


ggtttgagtg 


cagatgggct 


ttgggggctc 


cattttggca 


tttctggaag 


ggtccatagc 


tttcatcaga 


gttaagaact 


acagtccttc 


ttccacatca 


tccccctgcc 


cacacacaca 


cacacacaca 


cacatacaca 


cacacacaca 


tacacacaca 


ctccaccttc 


ctttctcctt 


tctctccctt 


ctacatgtga 


cactaatgca 


actttcctca 


gccagcccaa 


gtccctctac 


cttataagag 


ccccgacagc 


cacacaagtt 


ctgcaaatga 


acccaaactc 


agaactcagc 


ccctttccct 


gtatctctga 


ccttcgtgga 


atgtggggct 


gcctaccatt 


gagcacacaa 


tgcctaccac 


gcagcaggca 


gttgtcctgg 


catctcaaat 


ctgctgagtc 


ccttcagccc 


tcatgacctt 


ctttgctgcc 


ttcttgatcc 


caggactgtc 


cttctgaaac 


aaaggctcat 


ctcctctggc 


gtccagccct 


tcctccccat 


caaatctcta 


ttttccactc 


ataggagaat 


atggccttgg 


tttcactgta 


aataactatg 


tccactctat 


qtatqqaqtq 


aqqqttaqqa 


acatggagtc 


aatgcatgtg 


ccatagagtg 


gaaacactta 


gtgtgagttg 


ttcaaaatct 


ctaagcctca 
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ttactttttt 63300 

cgcagtggtg 63 3 60 

gcctcagcct 63 42 0 

gtatttttag 63480 

tcaggtgatc 63540 

acccggccaa 63 60 0 

acactgggaa 63660 

aaaacactca 63720 

cttgagcact 63780 

aaatacctaa 63840 

ttatcagagt 63900 

gtttattacc 63 960 

cacatacaca 64 02 0 

cactctctct 64080 

ctgttacagt 64140 

taaatggcac 642 00 

ttaaataaaa 642 60 

acagggctcc 64 32 0 

cccacaacca 64380 

atggagaggg 64440 

ggagcactgg 64 50 0 

gcctcactga 64 560 

cctccttttg 64620 

tttctatttc 64680 

acctcagaag 64740 

taaacagaag 64 800 

ccttggaagg 64 860 

ggcacatgca 64 92 0 

tgaaatcaag 64 98 0 

tttacaatga 65040 

gtctcctcat 65100 
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tgataaaatg gcaagagtaa tactgcttga 
ttgaacaagt acttcatctg ttggggggag 

5 

cttcttttgg ggccgttgca aatgttctaa 
ataaatatac taaaaataat tgcataatat 
10 aattatatgt caatgaagcc ttttctaaaa 

agctcctatt aaatatagct catgccctct 
tgatggcagg caatcccatc agccacagtg 

15 

cgcaattgac a tea tea tec tegtatttga 
ctgagtaact aaaagttatt tgtccagata 
20 ctatctaaaa gcagcagaga agacgctggt 

aaacctaact gaatcttgat ataaagattg 
aaaatacaag gaaataatcc aacatttgag 

25 

ccagatgtag ggcataagtt gtctcatgac 
ccattttgta gatgaggaaa taaagactta 
30 gtttctagee agaatttaaa accaggtttg 

ccagtgcctt ttcaggcagt gcaaaacagc 
tggttccagc aaaggtaccc actgtgtgtg 

35 

ccctagcccc tgacctcttt ctaaacccac 
tctctctccg tggcagcaga aaatcageca 
40 agaccagctc ttccaactgg agacctatag 

catacgattc ctcccccagc cttgcaccta 
ttcccgctca tacagactgg ccacaggagc 

45 

caagaggatg cagaagtgag aaggggcaat 
ggagagegag agagagagag agagagagag 
50 gagaggcaca gagctcccag gggcccaggg 

acttccagta cagagaagag gaetcegggg 
ctaaggaatc cagccaacag aactctgata 

55 

agcaacagac caagaaatac cttcatttgt 
gaccaggtcc ctacaaccag atgagatcta 
60 gtggaagcgt atetttctge acatggccca 



tgggccacca 


tgtggttaaa 


tgaggegagg 


65160 


ggcggtgatg 


ggttgctagt 


cggtacgtgg 


65220 


cttacattgt 


agtgatgatt 


gcacaacttt 


65280 


actttaaatg 


ggtgaattat 


atgatacatg 


65340 


aagcattttc 


tatagtgece 


cacaaatggt 


65400 


cataccactc 


ttgctagccc 


actgggcaaa 


65460 


ectgeggaat 


accatgttgg 


atgaaatgea 


65520 


tgttcagcat 


catagegaca 


caatgtttat 


65580 


tgcaggcagg 


caaggattga 


tgtcaggctt 


65640 


aaccccttcc 


ttctcccctc 


cagagccccc 


65700 


gaaaacaata 


gggtgtactt 


gtgggaaaat 


65760 


catctactaa 


gtgtaaagat 


atactaaatg 


65820 


aatgacactt 


gggaagagct 


ataaattttt 


65880 


gagaagttaa 


taacttaccc 


aaggtcaaga 


65940 


ctggatccta 


gaatatatat 


ttatcccacc 


66000 


tcattttttc 


tgttgcccat 


gttgagctga 


66060 


accagaatca 


cagaagectg 


gggaagggca 


66120 


tctgttctgt 


accagtttat 


catttcagga 


66180 


tattcttatt 


gcctggtaag 


ttacattctg 


66240 


aatggtgtca 


caaaggggaa 


aataaggtca 


66300 


ctataaccag 


acacaeggtg 


ctcacacaca 


66360 


ctggccctgg 


agtcacatgt 


ggaaaagtcc 


66420 


acagtcacct 


tagttaaggc 


atcagagtat 


66480 


ataacagctt 


tcctaagaag 


ccctgccaca 


66540 


ctgccctcta 


gcagaccagg 


agecaagagt 


66600 


aagecgaagg 


aggctcagtc 


actcgctttc 


66660 


atcaacatgg 


acaaagcttt 


gtgctcccct 


66720 


ctcataatca 


ggttacgaag 


ggtaaagggt 


66780 


tttgaaagtc 


ttcaaaagaa 


agctcaaaat 


66840 


gaatcatgga 


gcatctgagc 


ttatcacatc 


66900 
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ccaagacact 


tgggctaaga 


ttctatttat 


cctatgggac 


aaataggtca 


agtgtttccc 


66960 


agagattatt 


cttttttctt 


tttctttttc 


tttttttttt 


tttttttttt 


tttgtgtgtg 


67020 


tgtgtgagac 


agagtctcac 


tgtgtccccc 


aggctggaat 


gcagtggttt 


gatctcaact 


67080 


cactgcagca 


tctgcctcct 


gggttcaagc 


aattctcctg 


cctcagcctc 


cctagtagct 


67140 


gggattacag 


gtgcacacca 


ccatgcccag 


ctaattattg 


tattttttgt 


attttttggt 


67200 


attttggtag 


agacggagtt 


ttgccatttt 


ggccaggctg 


gtcttgaact 


cctgacctca 


67260 


ggtgatccac 


ctgcctcagc 


ctcccaaagt 


gctgggatta 


cagatgtgag 


catccgcgtc 


67320 


cagccccaaa 


gattattctt 


gagaacactt 


ctcccatgat 


atgctcaatg 


aagatagtta 


67380 


catgaacttg 


ggaaaatctc 


tatgtattgt 


ttcctctctg 


aaagatctaa 


agtcctcatt 


67440 


aaaggctctg 


agaagtccca 


ccacgatgat 


ctgtttacct 


ttgcccaacc 


cagctatttt 


67500 


caaagctaac 


ttaccccaaa 


acctttcttg 


cacagaacac 


ccctttactt 


gcagtggaaa 


67560 


tagggttcct 


gggattcaca 


ttaagaaaca 


ctttttgatg 


aaaagagtta 


tagagatgga 


67620 


tggtggcaat 


ggttgcacag 


cattgtagat 


gtgtttaatg 


ctactgaact 


gtaacacttg 


67680 


aaaatgatta 


ggatgttaaa 


ttttatgtta 


catatatgtt 


aacacaattt 


tagaaattgg 


67740 


ggggaaaaag 


aaacacttgt 


tgggtctcca 


ttcatctcag 


ctataacagc 


ttttctcgcc 


67800 


catcctgacc 


ctctgtgtct 


tcttcagctg 


tctggcagtt 


tgggcttgaa 


catcgtcagt 


67860 


gcaatctgct 


ctgcagttgg 


agtcatactc 


ttcatcacag 


atctaagtat 


tccccaccca 


67920 


tatgcctacc 


ccgactatta 


tccttacgcc 


tggggtgtgg 


tgagtatccc 


tctcaaccaa 


67980 


agatcctcta 


agttctgaat 


tagctacatt 


tagaaaactc 


ccagaaaggg 


cttgacaacc 


68040 


aagctacctc 


actgaacatg 


gtaggaaaag 


gaggtgcttt 


cagcatagga 


acaaatttta 


68100 


aagtccttga 


cccatagctt 


taagcacatg 


acctttccca 


gtttgggttt 


ctgcctttgt 


68160 


agaacaatac 


ctaccttgtg 


gcaatgttga 


gagaattaaa 


tgcaatttta 


caaagaatgc 


68220 


atatgtgccc 


accaaccagc 


acctctcctc 


acattctctg 


ccttcttgcc 


cattactata 


68280 


agccagaggt 


tagcaaactt 


attctctaaa 


gagccaatag 


taagcagtta 


aggttttgca 


68340 


ggcctacagt 


ctctgtcatg 


tagctatagc 


tgccttacag 


cagccacaga 


aagcactaaa 


68400 


caaatgaata 


tggctgtgtt 


ctgataaaac 


tttatttaca 


aaacaggcag 


caggcaggat 


68460 


ttgacccatg 


gatcataatt 


cacccacctc 


tgctatacac 


tagcactgtc 


caatagaaat 


68520 


atgcaagcca 


catgttattt 


taaattttct 


agttgccaca 


tttaaaagag 


agagagagtg 


68580 


aaagcaacag 


gtgaaattga 


ttttaataat 


ctattttatt 


taaccccgta 


tatctaaaat 


68640 


attgtcattt 


caacatgtaa 


tcagtataaa 


aacttggttt 


gctattttgc 


atttggggaa 


68700 


ctaaatcttt 


gaaatccagt 


ttatattttg 


tgctatggca 


catctcgatt 


tgggaccagc 


68760 
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tacatctcaa gtgtttaaaa gccacatggg aattgtggtt gctgaataga acagggcagc 68820 

tacagatgag ctacctgtgt cccacgtaaa ttcaatcctc tgtctgcgca actgagtccc 68880 

5 

tcctctccta cctgctcaag ttctcctccc ctctccctca tcccgtctct cctctacatc 68940 

atcagtcttt ccccctttgc tggggctttt gcatcagcat taagcatgct tttgtttctt 69000 

10 aaaaaacaaa ccaacccacc tcctcttcgt tgtacttcct ttatccacta ccactgtatc 69060 

tctctggccc tctttgcaga caaattctta aaagaattgt ctacacttta gtgagtccaa 69120 

tcctcttctc tcatattctc tctgaacctg ctccaattag actttcagcc tcacttccct 69180 

15 

aaacacctgc tcttctccaa atccccaatg gggtgattga tccttcttct cggagggact 69240 

ttgcttgact tccaggctcc accctcactg gttgccctct tctctgtctc ctctgctagt 69300 

20 tctgtttttc ccaatctctt aggctcaata attggttctc tggttttctt tctgtactca 69360 

tttgtctatc tcatttagtc tggcaacttg aaacaccttg tctatgacaa ttgttctaga 69420 

atacatggct ccagccaaat cttctcccct atacaccagt ctagcatgtc caatggtcaa 69480 

25 

ctcaatagct catagttcat acatccaagt ctgatctcca gatttcactt cccaaacctt 69540 

ctctactcag ttttcctgat tccagttgat ggcaactcca tccttccagt tgctcgggac 69600 

30 aaaaccttgg agtcgtcatt gaccctgtct ttctctcaca cctcacagcc aatccctcgg 69660 

gcaatcctgt tggttctact ttcaaaatgc attcagaatc ccaccatttt cccccagccc 69720 

ctctactccc acgctggcct gagccaccgc atctctcatc taatcatgga aacagcctcc 69780 

35 

tgactgttcc ctttctccta ctctgggctc actgcagctg tttatagcat ggccaccaga 69840 

gggatccttt ggaaacacaa gatcccatca cttctctgct ggcatcttac tcagaaaaaa 69900 

40 agccaatgtc cttccagcag cctgaagatg ctggccctct ggtgcccccc gcccctggct 69960 

ctgctgccat cctctggttc tgctgctgaa ccacgggaac ctccatgatg ttttctccaa 7 0 020 

cactccaggc atctttccac ctttgggcct ggaatgttcc ctccgactgg ggaactcttc 7008 0 

45 

cctcagatgt tcccacagct tcctctgtcc cttcctcacc tccttcagat cactaatctg 70140 

caaccacctt cccaatgagg cctacctgaa caacccaaca cccacactct cccagtaacc 70200 

50 cgcttccctt ccccttatct atctttcctg tagtctatgc atctaacacc ctctgcaact 70260 

gacttatttg ttacatttca tgcttactgc ctgtccctag aatgagattc ttaaggacag 70320 

ggatgttttt ggtttcactc actgtcatac ctcaaatacc tagaactgtg tctggcacac 70380 

55 

ggttgacact tcataagcat ttgtcaagga ataaatgaat gtgccagctg tgatgcatac 70440 

tcagtaaaca tcaacagcca ttaactgttg tttgtatttg accagatcct tttatcttca 70500 

60 agagacttaa agacttatct ttccaagatt cttaggcaaa gcctctccct agagatccta 70560 
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tggaccaagt 


ccatgtcatt 


tatcaaaagt 


accttgagtt 


tctcccctcc 


cagaaacatt 


70620 




ttatggctcc 


cagatacttt 


aggattaaag 


tccaaattcc 


tcagccaggt 


atttgccctc 


70680 


5 


cctgggggat 


ctggctctat 


ctcactgtgc 


cctacagttc 


acctcccttc 


gaatctctca 


70740 




ctcaccctga 


accccagcta 


ctcccgtcca 


ctgcgagaga 


caggcacaag 


cgagtacttg 


70800 


10 


ggaattttcc 


tgctgggaaa 


gaaccaatca 


gggaggcatg 


gactgcccaa 


tagaagaaat 


70860 


taagaagcaa 


atgctggtgg 


aggaactaag 


ggaaaggaag 


atgggagaag 


agtttaggaa 


70920 




agactaaacc 


ccaagcagca 


cagccaggtg 


ccctggagta 


gagggagcag 


caatttttga 


70980 


15 


ttcagaaaac 


ttggctttta 


atcttagttc 


tgccaatgat 


tgatcttgga 


atgagaactc 


71040 




gccccatgcc 


caccacccat 


tccaccttca 


agcctcagtc 


atcccatctg 


aacatggagg 


71100 


20 


gtcggactgc 


atctgagctt 


tgcaaataat 


gacccaagga 


atggagatac 


tccggggttt 


71160 


ccacatcagg 


aattattttt 


catcaaataa 


aaaaaatctc 


agctgcttga 


gggacccctt 


71220 




ttccatgaaa 


tattttatgt 


gtaaaaaaat 


tttatttaaa 


aaaaattgaa 


aaagaccagc 


71280 


25 


atactttgcc 


caggccattc 


tactctaagt 


ctctgtgatc 


cgttgactta 


gacatcctgc 


71340 




actccccttt 


tcctgggaca 


actgcctcta 


ccctcaccct 


ctctaaatcc 


cttcatgaca 


71400 


30 


atgaacctct 


ccctcctcag 


tgctctggga 


tacctttttg 


ctgttctgta 


gctcttaccc 


71460 


tatctgatgt 


gaattataat 


tatcatctgc 


atctgttttc 


ctactagatt 


gtgcctctca 


71520 




ggatgcattt 


cttagccgct 


tctaccaagc 


agatgtttaa 


taaccacttg 


ctaaatggag 


71580 


35 


aggaaggaga 


agaaaagggg 


ctaaggagag 


gcctctccag 


aagagagaat 


tctccaagca 


71640 




tctggctgtc 


accaatgctg 


tttttaaaaa 


tcataattga 


gataattgag 


gccggtcacg 


71700 


40 


gtggctcaca 


cctgtaatcc 


cagcactttg 


ggagaccgag 


gcaggtggat 


cacttgagct 


71760 


caggagttca 


agaccagcct 


gggcaacatg 


gcgaaatccc 


atctatacaa 


aatatacaaa 


71820 




aattagccag 


gcatggtagc 


gcgtgcctgt 


aatcccagct 


actcgggagg 


ctgaggcagg 


71880 


45 


agaatcgctt 


caacccagga 


ggcggagctc 


acaatgctga 


gctcatgcca 


ctgctctcca 


71940 




acctgagtgg 


cagagcaaca 


ctctgtctca 


aaaaaaaaaa 


aaaattaaaa 


aatcataatt 


72000 


50 


gagaggcatt 


tctttggtac 


ctactgcatg 


ccatccacct 


tcactatgaa 


atgttctcat 


72060 


gtcacataat 


tggaacccaa 


atggtaatgg 


gagtctcctg 


gatggacctt 


ttagattgaa 


72120 




actcgcttcc 


cagatacact 


ggaagactcc 


atactcctgg 


gtattcttgg 


gtaacagcaa 


72180 


55 


atggagggtt 


tctatttctc 


ccggttacct 


cctgctatgg 


ttaatttact 


tcattttggg 


72240 




gatcctttca 


ctgactgttc 


ttcccaagcc 


agagaaacag 


ccccaattcc 


tgccctccac 


72300 


60 


aaaacaccac 


tgtcacaagc 


caagaaagtc 


ttttcagagg 


ctgacatttt 


cccaaaggtt 


72360 


cagatgaagg 


ggtgggttgc 


cccttcacac 


ctgtgggtgt 


ttctcgttag 


gtggaacaag 


72420 
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agacttggaa aagaaagaga cacagagata 
ggaccggcgt tcagcataca gaggatccac 

5 " "" 

tttattgatc attattgggc atgggtaata 
aacacgtgaa caaatgtctc tgcatcataa 
10 gatgtgcata tacataaaca tctcaatgcc 

cacctccagc cctaaggcag ttttccccta 
acaccgagac attccattgc ccagagacga 

15 

actgcaaaga gccgttcctt cctcttttac 
tgttgggctg ggggacggtc aggtctttcc 
20 catggggaga aaccttggaa aatacctggc 
gcagtgtttt gtgtctctgg gtacttgaga 
gatgactctt tttttttttt tttttttttt 

25 

ggccggactg cggactgcag tggcgcaatc 
tcatgccatt ctcctgcctc agcctcccga 
30 gcccggctaa ttttttgtat ttttagtaga 
tctcgatctc ctgacctcat gatccacccg 
gcgtgagcca ccgcgcccgg ccggtgatga 

35 

tgtttaacaa agcacaccct gcacaaccct 
catgtttcag ggagcacagg gttgggggta 
40 aagaattttt cttagtacag aacaaaatgg 

acacagtaac aatctgatct ctctttcttt 
acatcagagc ccagcaaacc actgattact 

45 

ttcatttttc acaggaggcc ttttcctgtg 
ccttcttttt cctctaacca tctccctgtt 
50 cctaccgacg gcctccccag agcagtgtca 

ctcaggtctc cacacccaaa gcaacctatc 
ttctcagagg aacactacca agtaaatgtt 

55 

atacttacaa aaacccggtg ggaaatttcc 
tccgcaaagc cacaaagagt ccatggcagg 
60 gtcccgtgct cctgtcctcc ccatctgctc 



aagtacagag aaagaaaaag ggggcctagg 72480 

gtgggcactg gcctctgagt tcccttagta 72 54 0 

ggataatagt ggagagaagg tcagaaggta 72 60 0 

acaaggtaaa gaaaaaagtg ctgtgctttt 72660 

ttaaggagca gtattgctgc cagcatgtcc 72 72 0 

tctcagtaaa tggaatatac aatggacttt 72780 

gcaggagaca gaagctttcc tcttacctca 72840 

taatcctcct cagcacagac cctttacggg 72900 

catcccatga ggccatattt cagactatca 72 96 0 

tttcctaggc agaggtctct gtggccttcc 73 02 0 

ttagggagtg gtttgagatt agggagtggt 73 080 

ttttgagacg gagtctcgct ctgtcgccca 73140 

tcggctcact gcaagctcca cttcccgggt 7320 0 

gtagctggga ctacaggcac ccgccaccgc 73260 

gacagggttt caccttgtta gccaggatgg 73 32 0 

cctcggcctc ccaaagtgct gggattacag 73380 

ctcttaacaa gcgtgctgcc ttcaagcatt 73440 

taatccatct aaccctgact tgacacagca 73500 

gggttacaga ttaacagcag ctcgaggcag 73560 

agtctcctat gtc tact tec ttctacacag 73 62 0 

tccccacatt cagagggatg gtggaggttt 73 68 0 

gaagataaga catccttgct ttgggaagtc 73740 

gcctccccct tcccaccatg gcgctcaact 73800 

ctctctgtgt cttgtccata ccctccaatt 73860 

ctgctcccct ctctgactcc agagcacacc 73920 

aaggggcttc cctaaacatg gtcatctttt 73980 

gcaataacaa ttcttttacc cccatacttg 74040 

ccccacgagg ctgagagagg ataggggact 74100 

gccccagtga gaggtcagtt cggactccca 7416 0 

tgectacaga accctggaat ggegatttet 74220 
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ggcgtgctgc tggtcttctg cctcctggag 
ggctgccagt tggtctgctg tcaatcaagc 
5 gaaagatgcc cccaaaaagg tggagacaag 
aggagcagga gtagttcctc ggtgtccgag 
atgccccctg tgtgcctgtg ttttccaggt 

10 

aaacccagtg atcaccccag aaccggtgac 
agcaaataag taaggctaca gattctggaa 
15 cctccctttc tgggcttcca taacccaggt 
tctcccactg tttgtactct caccttcatt 
tctctatcaa gaagaagaca gagattttaa 

20 

gcacatgcat cagcacatat gtgggcatcc 
gtgtgctctg ctgcatgtga gcttgtgggt 
25 tcactctttc aattgtgcat tcatttaata 

gctctcttca tggcaggagg gggctggaca 
acttctggcc ttaggagaga ggacaccagg 

30 

ggcaacatag gggagaggca gaaggagccc 
aaggaaggaa tgaggagagc agaggacagg 
35 cattaaagga cagtaaggat tttccgtgta 

agagagaatg atcagtcctt ttagattctg 
tgcaagaatg tagaaggaat cttccagaga 

40 

ggactaattc caaagagaaa tggccatgtc 
gtaacagcat ctaaaaggac taatcctttc 
45 ccaaacagcg tcgtccctca tactgaaaat 
tgtcgcttcc tgaaacacgc aaaatttaaa 
caagaagttg cattctgtgg ggttatagca 

50 

tttccttagc tacagaagca ctgcaaggat 
aatacaaaat gtgattttgc tcttgtaaat 
55 tcttctaaat ggcatgcagt tagaggtcta 
tctgatcagg aatccttctg atgcccatca 
ctagaactaa atgggcctta ggaattacag 

60 

aatatgaggc atgtaatatc ctggttctag 



tttggcatcg catgcgcatc ttcccacttt 74280 

aatgtgagtc ccagggttcc tcagtgggtg 74340 

ggagctcttt gtgctttcca ctgcctgctc 74400 

gcctttggtg catggcccgt ccttcttagc 74460 

gagtgtcatc tatccaaaca tctatgcagc 74520 

ctcaccacca agttattcca gtgagatcca 74580 

gcatctttca ctgggaccaa aagaagtcct 74640 

cgttcctgtt ctgacagctg aggaaacgtc 74700 

cttcaattca gtctaggaaa ccatgctgtt 74760 

acagatgtta accaagaggg actccctagg 74820 

agcctctggg gccttggc'ac acacacattc 74880 

tagaggaaca aatatctaga cattcaatct 74940 

aatagatact gagcattcaa tgtgttcaag 75000 

ttgacatgac acagtcttgg acctcaagaa 75 060 

acatcacggg agcaattttc actgacaaag 75120 

agagtgaatt tattctggct gtgatgcaga 7518 0 

tctggttggg gtagcaacat ctgaatgggg 7 5240 

ataagggacc acgtggttca ggaggccttc 75300 

ttaccaatga ggcaaccaaa ttcacgaatg 75360 

cataaggcca atacatcatt atctgtaaaa .75420 

attcctatta ccatttctgc atttaccgtg 75480 

agcagacgac tcactgtctc caaaggtctc 75 54 0 

ccctcactgt cctttaatgg cccatttaga 75600 

cattgttatg tatgataacc aatgggcatc 75 660 

atcatctcta aaatccagct gttgtcaagc 75720 

ttgtttgttt gttgtttgtt ttggttacta 75780 

aactttgttc ttggggttat ctaagaagtt 75840 

tcgcctgtgc acctgcatct tctcattttc 75900 

gctagccagg aacaggagtg ttttcatttc 75960 

acctcctgaa aaattgattt tttcaaatgg 76020 

atcagtcctt cacatacctg ttttgtaagc 76 08 0 



-317- 



WO 02/062946 

caggaggtta ggacattgag ctccccaaaa 
ggctcggtaa acagagcaag cctgcccctc 

5 

gctgccccat ctaactccac acctccttct 
accctccatt ctgttcacag agcctccttc 
10 atattgctaa catagctcgc ttcctgcagt 

tacatctggt gcctctccat atgggggcac 
gcttgttgga acagattctt tcctatctta 

15 

cccttgtcct ggggggacca attgccaacc 
tgcattctct gcaccacccc acaagccctc 
20 actccataac aatggaacat aaaacccgga 
ctatttaacc atagcactgt ggcagtgcaa 
tgcaatttaa ttgccattgt aatggtattt 

25 

cataaggcct ctgccctcat gaatggagta 
cacaggagtg ggcttctgat agaaggataa 
30 gtgctcattt gcccttcagc catgttatga 

catgctcttg gacttcccag cctccagaac 
ttacccagtc tctggtattc tgttatagca 

35 

gaaatttgcc caaggtcatc aacccagcta 
cagatggccg gttcagggct ccttgcagaa 
40 tgactaagat taaaacactg tcttcacttc 

gtgccacttc accagccaga gacttccgca 
atgactgctg agctggctcc tcccacttgg 

45 " 

agcccggatc tcacacccac catgggcaag 
caagtgagtg agagtggggt ctggccatgg 
50 ggcagctcca ggggccagca caggcacaag 
tgccttaagt ggcttccacc ttgagcactg 
gaacacttag agatgccagc aactgcagag 

55 

tctggctcag ggagtaccaa gatctgggct 
tccggcaaac gagagcatgt caccacccat 
60 agcttggttg ttttacagct tgtttggtcc 
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gggacccacc 


tgtttgcacc 


gttctctgag 


76140 


tcaccacctc 


ctttcttatt 


gctgcctctt 


76200 


catcctacag 


aactctaccc 


ctcaaacctg 


76260 


ctgacatttc 


tgacctatga 


gttaattcag 


76320 


gtgggtcatc 


ccagagaaag 


cactttctaa 


76380 


ctgaccctgc 


cctcctgaaa 


tcaccacagg 


76440 


tcacctacta 


tattagtcct 


gtgttctagg 


76500 


tagggtgaca 


gctcattgca 


taaaatgtta 


76560 


cacccacagg 


gctccatccc 


ctccctatcc 


76620 


agagatctca 


gagatctgta 


gcccacccaa 


76680 


tggtttaaat 


ctcccctcca 


aaactcatgt 


76740 


aaaagtggag 


tctttaaaag 


atgattagat 


76800 


atgccattat 


tttgggagtg 


ggttagttat 


76860 


attttggtcc 


ttgtcctctc 


tccgcctcat 


76920 


tgcatcaggg 


ccctcagcag 


atactggcac 


76980 


tgtgagaaat 


aaatttattt 


tctttatgaa 


77040 


atgtaaaaca 


gactaagaca 


ggtagagatg 


77100 


gaggtatagc 


caggtccaga 


accaggactt 


77160 


ggaggagcat 


gtggtaaaaa 


aaaatcaggg 


77220 


cttagcttga 


aatgacacag 


gatccttcgg 


77280 


gccagcagca 


cctctgctag 


agttttgctc 


77340 


cccagcaggc 


tgcgctcagc 


tcatgctacc 


77400 


ttaggcatgg 


agcagaaagg 


ggtgtgtgag 


77460 


tacacctgtg 


ctggctgctg 


tggtggggca 


77520 


ctctgtgcaa 


ggctatggct 


ggacaaggca 


77580 


gcatttggac 


aaggggaaca 


catggtgcct 


77640 


ccccaaatgg 


gtgttacagc 


atgtcacagc 


77700 


cccataaggg 


tcacagctct 


tctctcatag 


77760 


agctcggcaa 


tctggccaga 


aacgtgtttc 


77820 


tgctgccctg 


ctccagccca 


cagttcctgg 


77880 
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gctggcctgg ccctgccctg cttcccatca 
gcaggaggga tacaatgtta cagctccttt 
5 ttgtcctgtg tccaggaaga atgaggttac 

aagagtttta ctgagcaata aaacagctct 
cctacccaga ggtgggtagt cccacgtggc 

10 

cccagactta ctgaattagc actctgggga 
ctgctgggtg attctgatgc acactctcag 
15 tcaaatgact gtttttctca ttttgagtac 

tgtttacaga ggcagcccat gccatctgta 
atgaactatg tcaaaacgga atgaattaca 

20 

aaaagatgac ccttcacctt tagacttgcg 
agcccaacca cttgctgaca gtctctggta 
25 ccttctcccc cttctcaccc cagcaaggct 

agttaaatca acaaccagga gtcaatacaa 
aaagcagcaa cattcgactt cacccaggta 

30 

ccagtgggcc aatccacaga ctgccagaag 
ctactgaaca ctgccagcgt gctcgcacgg 
35 atgaacacaa tcttatgaaa atggaagtaa 

aacttattct gttcaatgcc ctttgataaa 
gagtttaact gagcaatgaa caattcacaa 

40 

ctcagagact ccatgagtgc ctcgtgttca 
atgtactgga attggaagtg aggtacagaa 
45 ccttagttga aggcagcttg aatagtcagt 

ggattggcca atgctcagct attgttactg 
tgtctgacta ttaagctagg ttacagttca 

50 

gtccttctca ggccatattt agtttgcttt 
tattgcaaaa atacagagtg gcattatcta 
55 ggggctcaga gaggttaagt aatttgccca 

ggtcttaatt catgtctcct cactccaaat 
ctctcacagt tctggaggct gggaagccca 

60 

gtgaggggct gcttcctggt tcgtagacag 
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tgcagagcag 


ctgcccagcg 


tcagtagaag 


77940 


cgcagctgcc 


attccacggg 


tgctgggttt 


78000 


atggacaacc 


agagagtgag 


caaggcagag 


78060 


tagtggagaa 


gggccccaaa 


gcaggtagcc 


78120 


tgagtccaga 


ggttttatgg 


gctttacgat 


78180 


caggatccag 


caatctgttt 


ttgaacaagc 


78240 


gtttgagaac 


cactgctcca 


cagcatcttc 


78300 


cttctaacgt 


gatggcatac 


attatgatgg 


78360 


gacaattgta 


ctgagaggct 


cttcttaaga 


78420 


tcaagttttc 


atgtggaaga 


ttaaaaaaaa 


78480 


gaatcagagt 


attagataag 


actctgtctc 


78540 


ggttacgtcc 


cccatgttcc 


ctttggtcaa 


78600 


agtatcaggt 


acacttttta 


tgtctcataa 


78660 


actatataga 


tacagagggc 


aataaaggca 


78720 


agagggtaat 


aaaattccat 


aaaggacaac 


78780 


catcaggcaa 


ctgagcactc 


cccacctcca 


78840 


gactggtagt 


gctcaataaa 


tgtctgtcaa 


78900 


ttaacattta 


ttgagtcttt 


aatgtctgac 


78960 


agaaaaacct 


cagccgaatt 


aaatttaaag 


79020 


atcgggaagc 


ccccagaatc 


acagcagatt 


79080 


gaacaaattt 


atacacaaaa 


aggaaacaca 


79140 


acagtgagat 


tggttatagc 


tccgcatttg 


79200 


ggtctatgaa 


tggttgaagt 


atggccgctg 


79260 


atgcatacta 


ttaagttagg 


ttttcgattt 


79320 


tccacaagga 


ctcaaatata 


gaagtacaga 


79380 


aacacctttc 


agctgcatga 


tctcattttg 


79440 


tgctttagga 


aagagaaata 


tgaaaaatct 


79500 


agtctgcact 


gctagtaaat 


gatagagctg 


79560 


cccatgctct 


tttgccactg 


caaaatgtat 


79620 


agaccaaagt 


accagcagat 


ttggtgtctg 


79680 


cagtgctctc 


actatgtcct 


cacatggcag 


79740 
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aggggccaag ggagctctct ggagtctttt attatataaa gccactaatc ttattcatga 79800 

gcccctcatg acgtaatcac ctcccaaaaa acccacctcc aaatatcatt acattgagga 79860 

5 

ctaggtttta acataggaat tttaggatga caaaaacgtt cagtctatag cacagagtac 7992 0 

aaaaagctta ttgatacagc ttcagattcc acactgcaag taacctttcg ggaactaaca 79980 

10 cttgagttcg gtgtatcagg ggttcttaag aagcgggaag aggtgaattt ggtaacgtct 80040 

ggatgtattt tttgttgcca caactggagt tgagggaaag ggtgctattg gcatctagtg 80100 

ggtagaagcc aaggataatg ctgaacatcc tacaatgcac aggacagccc cttcatcaaa 80160 

15 

gaattgtcta gcccaaagtg gcattaatgt tgagggctaa gaaagaagac taccagtaat 80220 

tatctgaaaa ggctattaaa atactcactt ttctaactac cgatctgatt ttcttcatat 80280 

20 acttcaacca aaacaacata tcaaaacaga caatatataa gcagatgtga aaatctagct 80340 

ctcttccagt aatccaaaca ggaaagaggt atacatacct gatatggttc aggtcacaca 80400 

gccacaaaat gtagtgtcct agcaacggtg aaaacagcag aaacaggaaa gtcatgctca 80460 

25 

cctttcccta ccacttcttc cctgaagcaa ctcataagac cctaattcga aaggtaccct 8052 0 

cccaaccctt tgggaggctg aggcaggagg atcacctgag gccaggagtt cgaggctgca 80580 

30 gtgagctatg attgtatcac tacattacag cctagatgac agaatgggag ctcaaagaca 8064 0 

aaaaaaaaaa aaaaaagtag aaaagaaaag tgccctccct acaccgagag gaaaggaaca 80700 

tccttatctc tgaataaatg tggttacaaa atgatattaa acaaataggc cttgctaatt 80760 

35 

tcatcccagt ttattaccat taagtcatgg ttttttattt aatcatactt cacaattatt 8082 0 

cacgtcctca tcaaacctag cataaaaagt acacaagttt atctgtttct tcaggtcttt 80880 

40 atttccttat gaagaccatc acatcaagca aaacttatat taaataaatg tgtatgcttt 80940 

tctcttgtta atctgtcttt tgtcaaaagg cccttggcca taaatgtagt gatggttgaa 81000 

gaaaaggtat ttctcctctc ctatgaacct ataaaacaat accactcttc ccactaattt 81060 

45 

tttttcattt tggaaaatag aattattttt tatttaaaaa tatttatgtt aatatgtaat 81120 

aaatatttta ttttaaaata gattagtaaa tcatttttaa gtatctccat tttaatttct 81180 

50 aatatggtat aaatatcaac aaatttaacc cacataaaca agtgttcttc gagagtactt 81240 

aataactttt aaaaatgcaa aggactcctg agaccaaaga gtgtgggtcc tgagaccaga 81300 

gtttatgctg ttattctcct cctcaaagcc ctgcaagggt tcctagcatc cacagaataa 81360 

55 

attcctgacc atggcattca agaccctgtt cagctgccct tcaatccctc ttcccacaat 814 2 0 

gtccaaatcc cagttgtcca gcctctctgt caatggctcc taccttgaac tcaccaaatt 81480 

60 caggcatcca ttcactcatt catcaaatgc caagcacctg cctctgcagg taagaaccac 81540 
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atcccacccc ttaggagctc agaagctcat 
gtggggcaag gatgcaggag aggttccgac 
5 ctttgggagg ccaaggcagg tggatagtct 

ggtctctacc aaaagtaaaa aatgggtctg 
aacctgtaat cccagctacc tgggaggctc 

10 

ggaggttgca gtgaaccgag attgtgccac 
atgagggcga atgaagctcc ttctcaaaaa 
15 ctgtcaaggg acaaagcact tatttttttc 

tgtttttggc tcttttgggg tatgggaaat 
aggttgggga gggtaaaaca gagaattgtt 

20 

ttctgagggc agctctgcag gggccggagg 
gagtggagag gaagaggatg gggcctagaa 
25 aaagggccca gctgaagttc tctgtgacag 
ccagcacttg atgtttagtt cattgctctg 
tacactttag taaaattccg ctacacagaa 

30 

tcaaggaagg gagagttttc tagtgctgcc 
ttaattcaac gtatcgtctc acagatctgg 
35 agggtgggct ccttctgggg ctgtgaagga 
gtggtccgtg ggcatcttgg ccactccttg 
ttcatgttca tgcaggcttc ttcctgtgtg 

40 

aacaccagtc atagtaaatt agggcccctt 
ctccgaataa agtcacattc tgagatacta 
45 ttggagggtt ccatttcaac ccacagcagg 

gtggacaaac agccgtaaaa ggggagatgt 
gcaaggaaat tgactatgta aagcgtaatc 

50 

atggagctac tatatatagg atggcggctt 
tgattctata aggctggaaa aattggagac 
55 acatttgggg catgtgggtt ccaaagagat 

ttgtggagga agaggcttca ggatgagcgt 
cacacctctg gttcacactc ttctggaata 

60 

tctttctttc aatgtccaac ttgagttaga 
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aaggaaggat 


acacacatac 


tcggataaca 


81600 


aaaacgctat 


gggagcctgt 


aatcccagca 


81660 


taggtcagtc 


tggccgacat 


ggtgaaacct 


81720 


gtgtggtggc 


agccacctgt 


aatcccagct 


81780 


aggcaggaga 


atcgtttgaa 


cccagaaggc 


81840 


tgcactccag 


cctgggtgac 


atgagggtga 


81900 


aaaaaaggta 


tgagagtttg 


gagtaggatg 


81960 


aagtggcggc 


aggtttcctt 


aggaaataac 


82020 


gaagccctcc 


tcccatccca 


agaaatgacc 


82080 


ctgcacaata 


ttgcaggaag 


catctggaaa 


82140 


acgaggctct 


aggatgggaa 


gaaaaggaaa 


82200 


gcatggagga 


aggccagggt 


cctggagaag 


82260 


gaacgagcaa 


gggactaaga 


tcagagacat 


82320 


ttttggggca 


ttggcatctt 


tcccaccccc 


82380 


ccactttgtg 


tgagtaaagc 


tgtaggaaaa 


82440 


ataacaaagg 


actacaaact 


gggtggcttg 


82500 


agactggaag 


tctgagatca 


aggtgtcagc 


82560 


ctccattcca 


gacctccctg 


caggcttctg 


82620 


gcctatagat 


gcatcactcc 


agagtctgcc 


82680 


caagtctgtg 


caatatcccc 


tttttacaaa 


82740 


ttaacctaat 


tacttttgcc 


atgaaactat 


82800 


gatgaggggt 


taagacttca 


gtatatcttt 


82860 


agatgtgttc 


caggactgag 


ttcatgtggg 


82920 


gagggagatc 


aggagtgtat 


aggaaattga 


82980 


tacatgaaga 


aatcctccaa 


cttattggga 


83040 


tagccagttc 


atgtaaacta 


aaggaaaggg 


83100 


acacacacac 


acaaacatca 


caggcagctc 


83160 


ttagtttgac 


tgtaaagttc 


agggaaggct 


83220 


tggaggatgg 


ggtgaatctg 


gacatgctcc 


83280 


cactgtgcct 


tcagtgcagc 


tttgatccag 


83340 


tgtccaattc 


catgtccagc 


cactcgacaa 


83400 
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acctcctagg accagcccat ccctttgaca 

tgactgaata tttgtttgtt cctatccata 

5 

tattctgtct cgtcaactcc ttgcaaattc 

agcacacacc agacactcaa taaatgctca 

10 cagaaaatct tgacccactt ctggatttct 

atcccatgga atcctacttt agccaaggcc 

tggatgccag agaatagacg tattgactcc 

15 

tgaaagggaa acctaatttg agtttatttt 

cat cage tag caccagaatt tctagatcag 

20 agcatccaaa gacttcccag gagcacaagg 

ctgacagagc ctggatagca ccagcagaga 

tcttgagaac cacaaggaat caggggttca 

25 

gcgtgcagcc acccatcctt ggaggtgctg 

caggttctca aattgagctc ctgtctctgc 

30 ggagactctg ctaggcattt tctggctgcc 

gtgatcaggt ctacttagga cctcaatcta 

ctcttttttt tcttcttctt ctttttgaga 

35 

tgcagtggcg cgatctcggc teactgeaag 

gcctcagctt ccagagtagc tgggactaca 

40 tgtatttttt agtagagaca gggtttcacc 

cctcgtgatc tgcccgcctt ggcctcccaa 

tttgttcctg acattttcct gaegggaggt 

45 

atagggecca actgtttgtg ccaattgtca 

aaagaaggag tcctctgaaa aaaaggaaga 

50 gcagagtata acatttagca cagtattcat 

ttcttgtatt tcaaatctca acttcagcat 

ttaacctctc tgagcctcag agagtttgee 

55 

tgggggaact gecttatate tacagggtta 

tgttattcta aaccactctc ctgggaaatc 

60 aggtctaatg cagaagtggc tcacactgcc 
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catctcccag 


ccacatcatc 


cctgatggtc 


83460 


ettgeactag 


agttatccca 


acagtggtta 


83520 


tagagcacct 


cgtatcctag 


cataccacct 


83580 


etaattgett 


ataagggaaa 


tctattggct 


83640 


cactttctca 


aagagaaact 


ttggaaggag 


83700 


ataaatactc 


tgaggggcaa 


ataaacagcg 


83760 


aaatgaaact 


gcaaagagga 


ctttggagct 


83820 


ctagatgatt 


aacattaaat 


agtgccaatc 


83880 


getgeagaaa 


cccacagagc 


ctgaagaaag 


83940 


taagacttct 


gtgaattctt 


ccgcaagact 


84000 


cgtggaccta 


ggcggtccct 


cggaagtgtc 


84060 


gacatagggt 


agtgccctcc 


agcaagacag 


84120 


ggaggctggg 


aagtaactga 


agagtcacaa 


84180 


atggacacag 


caatcttaag 


atagaaggct 


84240 


ttgctcccta 


agcccagctg 


taggcacact 


84300 


gaactggtgg 


aatggcttct 


gatggaacca 


84360 


eggagtcteg 


ctctgtcacc 


caggctggag 


84420 


ttccacctcc 


eggtttcagg 


ccattctcct 


84480 


ggcgcccgcc 


accacgcccg 


gctaattttt 


84540 


gtgttaacca 


ggatggtctc 


gatctcctga 


84600 


agtgctggga 


ttacaggcag 


aaccattctc 


84660 


agaatctcag 


gggtcccagg 


aacatccctc 


8472 0 


ggtctaegge 


tgtattgttt 


ttcacacatt 


84780 


gagaaatgat 


taatttttta 


atatcatgaa 


84840 


tatgaaaatt 


gaaggcaggc 


agactatcaa 


84900 


ttcctagctg 


tgtgactatg 


gtcaaggcac 


84960 


cgagaaatgg 


tcataacaat 


aatatattgc 


85020 


tgaggcatca 


acatccaagg 


atgctgtagt 


85080 


tgttattctt 


caagaaagca 


aggaatgtca 


85140 


tgcacagcct 


ccattcatgc 


agctgaatgt 


85200 
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tccaccatcc 


atatgtcact 


ggtagaacat 




gcaaatagca 


gtcagaaaaa 


aataaatttg 


5 


attgtagaca 


agttattcca 


ctttcagagc 




tcggatcaaa 


atgagtgaga 


agacaggggt 


10 


tgacagttgt 


ttttcctatg 


gaagagtagc 




tttaaaataa 


ggaaaaagct 


gggaattttc 




gttgggatcc 


aaggaaaaag 


aagtgaaacc 


15 


ggaagtggtt 


ctcaaacttc 


agcagcatga 




atggctgatc 


cttgcccctg 


gactttctag 


20 


tttccgacaa 


gttcccggtg 


atactgatac 




tctgctatta 


atactaaatc 


taatagtttg 




ggcccctctc 


ttttttaaga 


ttggggtaac 


25 


tattttattt 


tattttattt 


tattttattt 




acgtgcaggc 


ttgttacata 


ggtatacatg 


30 


cgtcatctag 


gttttaagcc 


ccacatgcat 




ttgcccccca 


accctccgac 


aggccccagt 




ttctcattgt 


ttaactccca 


cttatgtttg 


35 


atgatggttt 


ccagcttcat 


ccacgtccct 




gctgcatagt 


attccatggt 


gtatatgtgc 


40 


gggcatttgg 


gttggttcca 


agactttgct 




ctacctttca 


acaaagaaca 


aatgttaact 




gttttctcaa 


tcttcaaaac 


accctttgga 


45 


ataagaaact 


gacagcgtat 


gcaacttgct 




tgagaagtgg 


acttagggct 


gtctgagctc 


50 


tactttatag 


acgattaata 


ttaaataatg 




tgtccattga 


caatttggag 


tgtttctctc 




atttatactt 


gtgatttagt 


gaggacagaa 


55 


gcacacgtta 


ttttgtgcca 


taggatttgg 




gtaaagggat 


aagcccacca 


gccacaacaa 


60 


ctctcatctg 


aatgggatat 


cttcgcatct 




tgcctacctc 


ctcccattgc 


acacttgctc 
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ttcttctggg 


aaatgaaaac 


atagtgaaat 


85260 


agtctcagat 


ctattcctgt 


gggttgcata 


85320 


cttggtctcc 


ctactttgca 


aagaggcggg 


85380 


gaatgcctta 


gccatcaggt 


gtcagaccag 


85440 


atggaaaggg 


gcctcagaga 


ccagctagat 


85500 


tgaagcttta 


tataaccaat 


attttgaact 


85560 


acaaaggaga 


agtgtagggc 


atttctgcat 


85620 


gattcacctg 


gagggtgtgt 


tgaaacacag 


85680 


tagctctggg 


gaggggacca 


agaatttgac 


85740 


tgctagtcca 


gagaccgtac 


tttgagatcc 


85800 


gggtcaatta 


ggaaaaatat 


tcacaacaat 


85860 


acttttattt 


tattttattt 


tattttattt 


85920 


ttactctaag 


ttctgggata 


catgtgcaga 


85980 


tgctatggtg 


gtttgctgca 


tctatcaacc 


86040 


taggtatttg 


tcctaatgct 


ctccctcccc 


86100 


gtgtaatgtt 


cccctccctg 


tgtccatgtg 


86160 


gttttctgtt 


cctgtgttag 


tttgctgaga 


86220 


gcaaaggaca 


tgaactcatt 


cttttttatg 


86280 


cacgttttct 


ttatccagtc 


tattgttgag 


86340 


aaacaatggc 


ccttctctta 


tgggtggtct 


86400 


gttccctctg 


tgccaggttc 


tatgctggca 


86460 


gaaggtgcca 


ctattatgca 


ccttttatag 


86520 


caaggtcgta 


caactaataa 


atgaaggaac 


86580 


ttaaccctga 


tactgtcttg 


ctttcgagtt 


86640 


ccaattcatc 


agctagcacc 


aaactcaaaa 


86700 


actcaggatg 


cagctgggac 


agggggatca 


86760 


agcctcctga 


gaacatacct 


ctggggtcct 


86820 


tgtcttgcca 


gtaaggactg 


tataatctaa 


86880 


ttaacaagta 


aagtacacta 


gcaacacca z 


o c qa n 


aataatatct 


ttaatgttct 


gagggctaca 


87000 


cccagcatca 


aaaaattaag 


caattctcta 


87060 
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agtccagctt atcagtcctc atcaaatcgg gacaatatcc caggtctccc cccaagtgat 87120 

tcccttctca actttttcta atgctcaaga ttcagtagaa actgcaggct attaagggtt 87180 

5 

tgatcactgt cataatctct aggaccctgg accataatgg tttaaacaca aggaaaatgc 87240 

aacaaaattt ctacatggtc gcaatcccaa caccaacatg gacaagaccc atgaagagat 87300 

10 tgaaagggat gatcattccc aggccacttg gcccatggtt cctcctggat gtcaagggca 87360 

cccaagaagc ctgggaaaga cagtctgttt ttttgctctg aaaccaatta atgttgaaaa 87420 

atcatacaca aaatgcctta attaaattaa acccataaat gagttaacta gttatagact 87480 

15 

tcaaggacag aagtacagct ggggcaaatg agaacctaga accgagagtc aaaaggacat 8754 0 

caaagaccaa ggtagtatct gtgtgtatgt gtgtatgtat gtgtgtgtgt gtgtgtgtgt 87600 

20 gtgtgtatct gtgtgtgtgt ctgtgtgtct gtgtctatgt ctgtgttatc tgtcttcctc 87660 

cctccttccc tctctctgtg tgtgtgtatg tgtgtgtgtg tgtctgtgta tctgtcttcc 87720 

tccctccttc cctctctctg tgtgtgtgag tgtgtgtctg tctgtctgtg tctgtgtctt 87780 

25 

cctccctcct tccctctttc tctgcgtgtg tgtctgtctg tctgtgtcta tgtgtcttct 87840 

ttctccttcc ctctcgctct ctctttctct ccttccccct tttcctttcc ctctctctct 87900 

30 ccttctctcc tcttctttta tctctccctc ccccaatgct tctcagcttt attctcttca 87960 

ctctactaac tggtgttctc tgctccccct gcacacaaca tacatgtgac ccctccaact 88020 

ctgcctagcc tcccatccca cagtagtcct tgggtccagt tccaaattcc caggagaaag 88080 

35 

aagttaccag agcatttggg gtcagttgtt cacccccaga tcaatcagct ttgccagggc 88140 

agggggcaag atggggaggt agcctgtgcc cactcaacaa actcaaggca tgcagattaa 88200 

40 aatgccaact gcagggggct aagcaaagcc accatccaga ctgtgaacaa aaggtctaca 88260 

agaagctaag tagcctcggg aaaccctagt cttgcaaatc caaatccgta aaaggaaaac 8832 0 

agtgtgttct tgccaagaaa ggcaagaagt ttattttgtc tggtaccatg ctaggtgatt 88380 

45 

cttattttaa tgaagatgaa actaaagata agaatgagat aagatgcatt attcattttg 88440 

tcagaagagg gaaatgcacc ttcttatgtg ttttgcagca gttgtacacc atgaccgaac 88500 

50 aggtgattgg agccaacagt gtacctggca ttattgcccc agataatgtt cacgtcatcc 88560 

agcccagcaa ccctgtggcc tctggaaatc atctgcagcc ttcagaggtg accacttacc 88620 

caatctcacc aaaagtaatc cactgtgata caggaagagc aaacttacag aatctactcg 88680 

55 

tggtgaacca gaactcagca gcaggtgtac agagtcaacc cattgggtat cagcgacagt 88740 

atccagtggg aacagccagt ttgcagacgg tgcctggagt gatccaatac acacagggaa 88800 

60 ccacgaatct ccagacatgg cctggagacc ttcagaatcc tctgaatgct aaccctgggc 88860 
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tgacacacac ctcaaactca tcccagtgga 
atcccaagaa attcataaat gaggaggtca 
5 ctctccgatt tgggtcactt cataagccct 

gtgttgagag ggtaaagggg aggtggagtg 
ccagtacaat cacccagatg ctggcccctc 

10 

acccatggtt tgggggataa aggtttgaat 
atctatcagg ggcatgggtc agcaatagtt 
15 ggcaatccag gtctcaggcc agctagattt 
gctctgggcc tccatttcct catctttaga 
ccagctccaa ggttttgtga tttccttttc 

20 

tcctccagtt tcatttagca ccttccaaag 
ccagatgcct ggctggatct ggatgcctgg 
25 cagatgccca tactctatgc actcatccat 
gagactcaga gataaccaaa acatcatcca 
agctgtcata attacaatac cttctggcaa 

30 

ctgcccaaag tgcaaattgt ctggcttctt 
ggaagacaag gacaggtgca ggaaggggac 
35 aagtagattg tgccaggaac aaggcgttgt 

tgaactaaat tctataaaac agcaatcagg 
ggctcatgcc tgtaatccca gcactttggg 

40 

aaatcgagac catcctggct aacatggtga 
tagccaggtg tggtggcggg tgcctgtagt 
45 atggcgtgaa cccgggaggc agagcttgca 

cctgggctac agagctagac tcagtcttta 
tcacaagcat caagtgagat tataaatgtg 

50 

ctatgatatt gcgtagtaat tatcatcttg 
atactgggta aaggtttgtt aactttggct 
55 ttcaaaattt gtcaaaacag aattaagtca 
aaatgaaaag ccaaaaagcc aaaactgcag 
gggaaggaaa ctatgagtgc ctcttagccc 

60 

gcagcacttg cagctgcagt atttagcccg 
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acacgtcatt 


tgcatcattt 


acttcattta 


88920 


gaacattagg 


ggtgagtgtg 


atttctcctt 


88980 


cgtctagtgc 


tctgggatga 


ggaagggaat 


89040 


ggtgtggggg 


gcagtttctg 


gaggctggac 


89100 


tcccttgtct 


tccttccact 


ccccaaggag 


89160 


aatcagtgaa 


catccccacc 


ttacccagct 


89220 


ccactaaaaa 


gaccactgag 


cagggagtca 


89280 


gctgagtggc 


cttgcagaca 


tctcttcctc 


89340 


ataagaagat 


tcagctttct 


taaattcctt 


89400 


aatcaagggg 


gcctctgcct 


ctttcctgcc 


89460 


ctcaattcaa 


ttctgttttt 


gaggcagaag 


89520 


ccaactctta 


gtatggagac 


tggcgtgtca 


89580 


tcaacactgc 


acgtgccctg 


caaggagctg 


89640 


tgaagatgaa 


ggagggtctt 


tgttcatggc 


89700 


ttgccatgag 


acatgcataa 


gttattgtgg 


89760 


tgagaagtta 


caaactgcct 


cctggaggca 


89820 


ccattcctcc 


ttatttaatc 


ctgggtagac 


89880 


ttgcctccca 


aacaaatcaa 


aatcaaatcc 


89940 


aatagaaatc 


ctggtcaggc 


cggacacggt 


90000 


aggccaaggc 


aggcagatta 


caaggtcagg 


90060 


aaccctgtct 


ctaccaaaaa 


tacaaaaaat 


90120 


cccagctact 


ctggaggctg 


aggcaggaga 


90180 


gtgagccgag 


acagcgccac 


tgcactccag 


90240 


aaaaaaaaga 


aaaaaagagg 


gaaatcctgg 


90300 


tgcggaaatc 


ctccaaaagg 


taaagctggt 


90360 


tattacaatg 


actatcatta 


ttattaatta 


90420 


agcaattatt 


ctaagatatt 


ttggaaggat 


90480 


gagcaaatat 


ctggggaacc 


tggccaaaca 


90540 


actatcttta 


atcaattaca 


attttgtttg 


90600 


ttttcccgat 


tgccccaaga 


atattcgcca 


90660 


agataacttt 


gccaatgtct 


cccttttatt 


90720 
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ataactttcg ccttgctcta gcatattgac 
agcattctgt ttttagtagt agtattccca 

5 

aaaatgtcaa atcctagaaa acgtagcatt 
cagttgttgg ccaaagattc atttgatgaa 
10 gttagttctg tttagaaatc actcgaaaaa 

atcagccaga tttgcttcag cctcaagagt 
agcgagcggc actatttttt ttctaaatgg 

15 

gggcctgtag acagagtcaa gcatgtcgcc 
tctgtcacca actcatagac tcatggacca 
20 tcatttgatc cctgaggaaa ctgaggccca 
ccaaatgaga ggggattcat cctggacggg 
agggaaggat aggatttcag gaggaggaga 

25 

cagtggagga aaagcaggga gtaaggggaa 
tccccttgtg tacgttgcca aggggaactg 
30 aatatcggta ctgggaagat gaatggagtc 
caggcctggc gtggtggctc acccctgtaa 
gttcacgagg tcaggagatc gagaccatcc 

35 

aaaatacaaa aaaaaattag cctggctggg 
gaggctgagg caggagaatc acgtgaacct 
40 acgccattgc actccagcct gggcgacagg 
aaaaaaagaa acacctactg catgctagga 
atctctacca cagccctgca agctggatac 

45 

ctcaaacatg ctcttgaggc tgaagcaaat 
taaagacctt tctttgagtt atttctaaac 
50 gaatcatcta ttttggaaaa atcagattca 

gaacgatgtt aacatcacgt gtaggaatgc 
tccctgcaaa taaggaaacc ctgaggcaga 

55 

ccagtgctgc agccaggaca ggagccaggt 
tgtctctgca cacttgtggg ctcagtctaa 
60 tgtcttacct ttcctttatg cctgtgattc 
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cttggaagca 


agagacatca 


ttctatttat 


90780 


tttacaaaat 


acagtaattc 


tcgattacta 


90840 


tctacgcatg 


gtgttaacat 


tgttctcaaa 


90900 


tccagtgttt 


ccgaagtaaa 


caattctaat 


90960 


ccgtttttat 


catttatttt 


cacaatgaaa 


91020 


gtgtttatgt 


aaaattaaat 


gagtgctggc 


91080 


gaacagggtt 


cattaatata 


ctgataaagt 


91140 


ttaggaaggg 


atgttcatgt 


gaaacccagg 


91200 


ggaagacaga 


aagcagctgc 


cctaggctag 


91260 


gagaagccag 


tggtaaatgc 


tgtaggagcc 


91320 


ggctcaggaa 


aagaatctct 


ctggtcctta 


91380 


gtaaagcaag 


cagatttcag 


acagtggcaa 


91440 


aaacacagga 


cttgcagggg 


acagaggatc 


91500 


tggggacgtc 


atagaggagg 


ccacaaagag 


91560 


agcagagggc 


attgaaatac 


ttactgcatt 


91620 


tcccagcact 


ttaagaggcc 


gaggcgggcg 


91680 


tggctaacat 


ggtaaacccc 


atctctacta 


91740 


tggcaggcac 


atgtattccc 


agctactcgg 


91800 


gggaggcaca 


gcttgcagtg 


agccaagatc 


91860 


gcaagactcc 


gtctcaaaaa 


aaaaaaaaaa 


91920 


actgggctac 


aagtttcata 


ggcactttta 


91980 


tgttaccccc 


atcttgcaaa 


tatgtagcct 


92040 


ctggctgatt 


ttcagagtga 


aaataaaata 


92100 


agaactaaca 


tcgaatggtc 


tgaatcatca 


92160 


tcacatgaat 


ttcggccgac 


aactatctga 


92220 


tacgttttct 


aggatttgac 


atttttggtt 


92280 


gaggatattt 


gctggagacc 


acacagccgg 


92340 


ggttttcagc 


cacaacccgg 


gctcggcccg 


92400 


agggaacaca 


tttcttgctc 


tgttataggc 


92460 


tcagcctgag 


tgcaaattcg 


catcacctgt 


92520 
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10 



gaagttttta 


aaaaaacacc 


caggtgtgga 


tctgggatga 


gcccccaaca 


ccaatcaccc 


caaagcacag 


ctccacagcc 


cacaggcccg 


ccctgcaggc 


catccagatc 


ctcatcggcc 


tgctgtatta 


ctatcctttt 


gtgacctggt 


ccgtgagtac 


aaggccatat 


ggtctccttc 


uggaggaaga 


aggagcaugg 


daCCCCdLLC 


gcctccaact 


tgcagataca 


gcaaagggtt 


tcccaccacc 


accctcccca 


tctcccatac 



tacaaacact cattatcatg ttctttacct 

20 

gatatttatc tgataaggcc aggcagggtg 
ggccgaggta ggtgaatcac ttggggtcag 
25 agacctcccc ccacccgcca tctctacaaa 
tgcacttgta gtcccagcta cttgggatgc 
gtggaggctt cagtgagcca tgatcacacc 

30 

agcctgtctc aaaggaaaaa aaaagaaata 
gtgtgctata gacagcatct acctaccata 
35 aggaattatg taaaatgatt gtcaaaccat 
tacatacgac atggaaatca acacgtgctc 
acctcataag gcagatgtta atcatttacc 

40 

gagcttaaat aatattaata aattgaacac 
cctagggtgg tcaaattcat aaagatagaa 
45 gtaagagggt aatggggagt tagtgtttta 

aaggagatct tggggtggct ggttgcacaa 
tttttttttt ttttgagggg gagtctcgct 

50 

atctcagctc actgtaacct ctgcctcctg 
cgagtagctg ggattacagg catgtgccac 
55 agagacgggg ttttgccatg ttggccagcc 

gccccccttg gcttcctaaa gcgctgggat 
caatatgaat tttcttggtg ctagtaaacc 

60 

ttaagttgtg tgtattttac cacgattaaa 
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ccctaccctc agagatttgt atccatttgg 92580 

ctggggtgat tctggggcac agccagggtg 92 64 0 

cagttctgct ctctcacagt gacttgttct 92700 

tgacgcacat tttctctgca attaaccctg 92 76 0 

tgtcagggta cccgctctgg ggaggattat 92820 

ctgggtcacc agactgggac aagaaggaag 92 88 0 

ccagtaatat gactctgtag actagagcca 92940 

atatcaactc cctgccccag caacactccc 93 000 

cccccaccca gcctctaggg aactattggg 93 06 0 

cttccaccct ctctgggttc tcttttatga 93120 

gctcacgcag gtaatcccag caatttggga 93180 

gagtttgaga ccagcctggg caacatggca 93240 

aatacaaaaa attagctggg tgtggtggtg 93300 

tgaggtggaa ggatcacctg agcccaggag 93360 

actgcactcc agcctgggta acagagtgag 9342 0 

tctcatatgt aactcagagt acgtaaagtt 93480 

ccagttatgg gagccgactg caaaatttcc 93540 

ttggtagcag gaagttggcc atgttgggta 93600 

caaatcaggg cccttccctg cagtgctgcc 93660 

agcaggccat tcagtatatt tcatcttgca 9372 0 

ccacatgtac aattccactt agatgaggtc 93 78 0 

aaagcagcat ggtgattgcc aggggctggg 93 84 0 

tggggactga gtttcagttt gggatgataa 93900 

caatatgaat tttcttttct tttcttttct 93960 

ctgttgccca ggctggagtg cagtagtgcc 9402 0 

ggttcaagaa attctcctgt gtcggcctcc 94080 

catgcccggc taattttttg tatttttagt 94140 

tggtctcgaa ctcctgacct caggtgatct 94200 

tacaggcatg agccatccca cccagcctaa 94260 

aaacacttga aggggtttga atggtatgtt 94320 

aatagtaaat taaattaaag ttttaaaaaa 94380 
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tgaacaccta 


cgtgtccacc 


acctaactta 


aaaaacaaag 


catcacacgt 


ggtttttaaa 


94440 


tttttaaatc 


cctataagta 


tcccctctat 


gccccgagag 


gtaaccacta 


ttttcaattt 


94500 


aatgtttttc 


tttttttttt 


ttttttgaga 


cggagtctcg 


ctctgtcgcc 


caggctggag 


94560 


tgcagtggcg 


ccatctcggc 


tcactgcaag 


ctccgcctcc 


cgggttcacg 


ccattctcct 


94620 


gcctcagcct 


ccagagtagc 


tgggactaca 


ggcgcctgcc 


accgtgcccg 


gctaattttt 


94680 


tttttttttt 


tttttttgga 


tttttagtag 


agacagggtt 


tcaccggggg 


ctcgatctcc 


94740 


tgaccttgtg 


atccacccgc 


ctcggcctcc 


caaagtgctg 


ggattacaag 


cgtgagccac 


94800 


ggggggtggc 


cgatttaatg 


gttttcattc 


ccttgctttt 


tttttttttt 


ttttttttga 


94860 


aaaggggtca 


ctttgtcacc 


caagctgaaa 


cacaggggca 


caattttggg 


tcactttaac 


94920 


ctctgcctcc 


taaggtcaag 


cgaatggccc 


gccccaaccc 


cctgaataac 


tgggattaca 


94980 


ggggggcccc 


acccctccca 


actaattttt 


gttttttcaa 


taaaaacagg 


ggttcaccat 


95040 


gttaaccagg 


gtgggctaaa 


actcctgggc 


tcataatcca 


cccacctcgg 


cctcccaaag 


95100 


tgctgggat 












95109 



<210> 81 

<211> 19303 

<212> DNA 

<213> homo sapiens 

<220> 

<221> genomic_JDNA 

<222> (1) . . (19303) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> coding_region 

<222> (8830) . . (9012) 

<220> 

< 2 2 1 > coding_region 

<222> (9468) . . (9587) 
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<220> 

<2 2 1 > coding_region 

<222> (10759) . . (10815) 

<220> 

<221> coding_region 

<222> (12466) . . (12597) 

<220> 

<221> coding_region 

<222> (14049) . . (14159) 

<220> 

<221> coding_region 

<222> (16890) . . (17009) 



<400> 81 

atttttctaa agtacaaagt catcatgtta ctgttctgcc taaacttatt caggctctcc 6 0 

tttgcccttg ggctaaagtc caaatgctct agcaacactt acaaagccct ggagatgtag 12 0 

cagttcacct ctccagcccc atccctcctc accctccact tacacttgac cccgagccat 180 

cctgagtctc cgtgagctcc atctgcctgg aaggctgctc tctgcctgct tccaaccctc 240 

ctatttgtcc tgcttragctc ccaaccccac ttcagacctt catttagata ccacttcctc 300 

caggaagcct tatctggcct catcccccaa ccccttctag attaggggaa ctcctccagg 360 

ctctcgccac acccacatct cagtatcaca tcactcaccc gtccacagca cagttacctg 420 

tctccctgtt catcttattc aatggacttc atattctttc cctattgctg ctgtaacaaa 480 

ttaccatgaa cttagtggcc taacataaca cagacttatt attagacagt tctgaggtca 540 

gaggtcctaa aatccaggtg tcggcagggc tgagttcctt ctggaggctc cagggaggcc 600 

ccgcttcctt gccttttcca gcttctagag gccacctgcc ttgcttggct tgggaccttg 660 

gcctccacct gccaagctct tccactcact ctgaccctct tgcctcccgc ttgggaatac 720 

attgggccca ctgaataatc tatccctatc aacaactttg acttaatcat ctgcaaagcc 780 
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cttttcactg 


cataaggtaa 


cgtattcaca 


qattctaggg 


atgtggacat 


tgttgaggtg 


840 




ccattgctct 


gactaccaca 


aattctgagc 


tcctcaaggc 


caggggcagc 


ctcttgccca 


900 


5 


ctcc tggatc 


cccagcatgt 


cacacagcgc 


ctggcccaag 


taagtgctca 


ttagctgttt 


960 




tatcaatgaa 


cggatcagtt 


aatgaacaag 


ggtatcaggg 


gggtgggggg 


ctgtcttcct 


1020 


10 


actcagcaaa 


aaagcactgg 


tcctggaaaa 


acactcaccc 


catgtttgag 


gagctaaagt 


1080 


catcracrtaat 

^3 *-3 3 3 »-3 


qaacqqqtqc 


aaggcaacca 


atgacagtga 


ttattggcct 


cacgtgtctc 


1140 




agcattacag 


tqqqcaqqca 


agtgccttca 


gaagaaacag 


gtctctctca 


aagctactgg 


1200 


15 


ctgc tgctca 


qcctqqcccc 


ctacccctca 


qqqqaaqcct 

ZJ Zj ZD _J -3 


gcatagtcca 


gcaggtctga 


1260 




qaqqqqtqaq 

3 3 3 3 3 ZJ ZJ 


tggcct tttg 


cctgattgtg 


tgtgtgacat 


cgaagtctat 


gccacattga 


1320 


20 


acatgcatga 


cagctgagca 


cgagcgaccc 


cggaagcaca 


cagcaactaa 


aataattagg 


1380 


cacggccact 


gacacgtgtg 


aqqcqctgct 

Zj ZJ Zj *~ Zj 


tttaagcatt 


ttacatattc 


taactcattt 


1440 




aaccct ccag 


caacttgatg 


ggtactatta 


tccccatttt 


atagagggga 


aaactgaggc 


1500 


25 


ccaccaaagt 


caaataactt 


gcccaaggtc 


acacaacatt 


aagaggtaga 


atcaagattt 


1560 




gaactcaggc 


atcccagatg 


cagagtcagc 


tctgaagtgc 


tatgctgtgc 


tcagcaagtc 


1620 


30 


aaagtcagtc 


agtcctggca 


tcagggacta 


tctgtacagc 


cgctggctct 


gcctcgcgct 


1680 


tccctgtcca 


ccctcagcag 


agcgatgtgg 


agagagaagc 


cagatggaaa 


aggaagagcc 


1740 




caaQQcqaqa 


tctaqqqqtC 

^ ZJ ZJ ZJ ZJ 


gctggatctg 


ggggtcgctg 


gatctggggg 


tcacaggatc 


1800 


35 


tqaqqattQC 


tqqatCtqqq 


aqttqtqqga 

—J -J ZJ ZJ ZJ 


tctgagggtt 


gttggatctg 


ggggttgtgg 


1860 




qatctqqqqq 

3 ^ Z? ZJ ZJ ZJ ZJ 


ttgctggatc 


tgggagttgt 

ZJ ZJ ZJ ZJ _J 


gggatctggg 


ggttgctgga 


tctgggagtt 


1920 


40 


ataqqatctq 


qaQQttqctq 

3 3 3 *- w 3 w w 3 


qatctgqqgt 

zj ZjZjZjZj^ 


ttgttggatc 


tgggggtcgt 


gggatctggg 


1980 


Qqttqctqaa 

3 3 , - , -3 v '»-'33 1 -* 


tctqqqgqtc 


actgqatcgq 

33 33 


gggggtcact 


ggatctgggg 


tcataggatc 


2040 




tqqqaqtcac 


aqqatctqqq 

ZJ ZJ ZJ ZJ ZJ 


ggtcacggga 


tctgggggtt 


gtgggatctg 


ggggtcatgg 


2100 


45 


aatCtqqqcfQ 

ZJ ^ ZJZJZJZJZJ 


tcacaggatc 


tqaqqgttqt 


gggatctggg 

ZJ ZJ ZJ — ) ~J —J 


ggtccctgga 


tctgggggtt 


2160 




qtqqtacctq 

3 33 3 


ggqqtcgctg 

ZJZ3Z3ZJ ZJ zw 


gatctggggt 


cactggatct 


gggggtcatg 


ggatctgggg 


2220 


50 


qtcactqqaq 


cccactccac 


acctcctcca 


ccagaacagc 


ctcacggaag 


accttcacac 


2280 


ttcccagagc 


agcaccgtgc 


tcatggaaag 


atcagggcac 


aagatgtggc 


aactgattct 


2340 




gtcagatgca 


atttgtgggc 


catgcctgga 


tcctggatcc 


tgatttgaac 


aaatcaaaag 


2400 


55 


taacaagaca 


tttgtgagat 


aatcagaaac 


tgaagctccg 


actagttatt 


tgatacccgg 


2460 




taagtaccaa 


aaactagata 


agaacaggtt 


tttctgactc 


ctagggttaa 


gcttgtcccc 


2520 


60 


cacaacaggg 


gtccccaacc 


cccaggccat 


ggagctatac 


cagtctgtgg 


cctgttagga 


2580 


accaggcctc 


acagcagaag 


gtgaggggca 


acaggcaagc 


atgacctacc 


acctgagtgc 


2640 
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cgcctcctgt cagatcagcc gcagcaatag attctcaaag cagaacgggc cctattatga 2700 

attgcacatg caagggatct aagtggtacc ctcatgagaa tcgaatgcct gatgatctga 2760 

5 

ggtggactaa tttcatccca gaaccatccc ccatccccac cccggcccat ggaaaaattg 2 82 0 

tcttccacga aactggtccc cagtgccaga acggttgggg accactgctc tatgaaagta 2 88 0 

10 gtttcttttt tcctttcctc tttttttttt tttttttttt ttgagacagg gtcacactgt 2940 

gttgcccgga ctgaggtgca gtggtgtgat cacagctcac tggactcaag tgatgctccc 3 00 0 

acctcagcct ctcgagcagc tggaaccata gacagggacc aagacacctg gctcattttt 3 060 

15 

gaatttttga atttttgtag agatgaggtc tcgctatgtt gcccaggctg gtctcaagct 312 0 

cttggctcaa gtgatcctcc cactttggtc tcccaaagag ctgggattgc aggcatgagc 3180 

20 caccgcccgg ccctttaaaa gactagtttc caaatgttca acagagctct gtggggctgc 3240 

agggttaaac tctgagcgta aatagacctc agtgagcacc aggactcggt tttaattttt 3300 

ttgttctgat catgtggtca cacttttttt ttaaagagtc catatctttt agagataaat 3360 

25 

actaaaatgc ttacaaataa aatgggaaga tatctgagtt ttgcttactt ctgggggaga 342 0 

ataagattgg ccaagagctg ctgatggtgg ctgtagctcg cgaatgggca caggggtgtc 34 8 0 

30 atggaaccat tctctctact tttgtgagtg cttgaaattt tccagaatta aaagaactct 3540 

ttcaaaaggt tcaacttctt actccactcc caggatctaa gacgattaca accactttca 3600 

ctccatcagt catgccctga aaacagatta aattatcaac atttcccttg ccaaaggaaa 3 660 

35 

tttacaactg tcaaggagag acataaagaa gggcactaag attttcacct gccggggttt 3 72 0 

99cgggaggg ggaaggactt tgacagcttc catgcaaccg gggcatgaag cagggcgtgg 3780 

40 gtccaggagg gagaagatcc aaagtcgaat gttgtaaaga tgtgctgatt tcccagcgct 3 84 0 

gtctctgggg agctccggca gcgcaagagg gcaaagcaca gctggaagct cagagctgca 3 900 

gtcccaggtc ctggggtgag tggtcccccc atggcaggaa cagaggggag tgattttgtg 3 960 

45 

tttccatgtg gagttggggg gttgctcagg tgtacagacc tggtcccccc ttgaagccag 4 02 0 

gggaccggcc tcttataccc ctagccaatc agttgttggc tgtagggttg gagcggtgtt 4 080 

50 acttaaactc ccagaaggga ggagatgtga gcctttagca atgaacaatt ataccagctg 414 0 

gagaatgggt ggcccaccca gcagagggga tctgggaagt agggggcacc gtcggcctct 4200 

cccctgcgga gccccatcct gctgccagtt actgaccctt attgctggag gagccccaga 4260 

55 

cacttttctt gcagtctgga aatgcataca cacagagaga ccagttcaag aaaacagagc 432 0 

cccagggagg agaagggact tgtcgagggc cacgaggggg caggactaga atccagattc 4380 

60 ttgactccct ccttggacgg aggcctctgt ttcctgcagg ccttccagct gctggtgggt 444 0 
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gagctctctt tacaccaggc acactgtctg 
gtcgtaggtg gttttctaac tggcattcac 
5 cccaacacca gcctgcctgt cacatgttgt 

cctgcctttc tgtccttgac atgcatgcac 
acgcacacac acacacacac acacaccctg 

10 

tgcacagtct gatgagggtc atattattca 
gacctcccat ggacacatct tactcactcc 
15 aactccacaa acaaatccac gaatgacagt 

tcagtgcctt tcttccctca cgacccccta 
gccctctccc cctgtaagca ggatggggat 

20 

agggacaggg tgtatcatgt cttaaccaag 
ctcacagccc tctccccctg taagcaggat 
25 gatgataggg atggggtgta ccatgtctta 
ctcactggtg ggattctaga cacgcaacag 
ctatacgcca gtccctgtca gttgcaggca 

30 

ctagggacag gcagtggcac tccccatgag 
taaaaaaccg gcccaaggtc aaagctccgg 
35 ccattctagc tgacaacttc cagcgctcag 

ggttctggag ggtcctgccc agctctgtcc 
aacaacgaga atatgtgtgg gacccaggca 

40 

ctagtggcag tgggtttccc gtgggatacc 
ctgagggggc tggggttggg cggggaaagg 
45 ctgctgcctg ccccctctct gtcagcacac 

tggaagtggg ccaggacagc tgggaataag 
gtcttggaaa ggcgtacgag agagtcataa 

50 

ccctctgcaa ctccttcctc tcctggttgt 
gtgagagaag tattatattt aaagttggaa 
55 ttcccactgt gatatgcacg tgagcaaacc 

ccaggaccag aaattgaaac cttcaaggcc 
ccaattgcat agaaccgtta gacaatgaga 

60 

gtggctcaca tctgtaatcc tagcactttc 
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gaagcaccag gaggaggctg gttcagtaca 4500 

tgaaacgctg atcctctaag gcaccccacc 4560 

gacactcttg tcactcagca cctctcaatt 4620 

atgcatgcac acacatgcac acacacacac 4 680 

agaagctctc tctgtcctct agggctccac 4 74 0 

tgccacgttt cctctgcctg ctcagccaca 4 80 0 

tgcctcggtc cctgcagtgt ccagtgcaga 4860 

gttggatctc agatcactag ctcaatacac 4920 

cccccacccc agggccccag agctctcacg 4980 

ggaggcaata aactcaccac tccagatgat 5 04 0 

aaccagacga ggacaaggtg gagcctcgct 510 0 

gcagatggag gcaataaact caccactcca 516 0 

accgagaacc agacgaggac aaggtggagc 522 0 

gccagtggca actgtggctg acatttattg 5280 

ttgtttcagc tactcctcac cacctacccc 5 34 0 

aaggtgaggg gagactcaga tgctgagatg 5400 

ggagcagagc caggactttg cacggagcac 5460 

ggccgtgccc actctgctgc tttgctgggg 552 0 

ctgtcctccc tccactccca gaactgaagg 5580 

tcccgtggcc ctgtgcagag cgctgaggcc 5640 

aagctggctg ggtccgtggt cctgagctgg 570 0 

ctaaaaaagt tcctttatgc accaaacagc 5 76 0 

tgtcacctcc tagcagccct gcacaggtcc 5820 

tgttctaagg aatcagagtt catgctcttt 588 0 

caaacttttc cccatgatga ctaaggtctg 5940 

ataaaattcc ttgtagcggc tctctatccc 6 00 0 

gggaccttag agatcatcac tcaattccca 6060 

aaaagcagag aaaaggccaa aaaccagagc 612 0 

actttgccca aaagcagacc ccacagaata 618 0 

aaaaaaagat cctatggcca gccgggcgca 624 0 

ggaggccaag gtaggcggat cacctgaggt 63 0 0 
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cgggagtttg 


agaccagcct 


gaccaacatg 


aagaaacccc 


atctctacta 


aaaatacaaa 


6360 


5 


attagcaggg 


cgtggtggca 


ggtgcctgta 


atcccagtta 


ctcaggaggc 


tgaggcagga 


6420 


gaatcacttg 


aacccgggag 


gcggaggttg 


cggtgagcca 


agattacgcc 


actacactcc 


6480 




agcctgggtg 


acaagagcga 


aactccgtct 


caaaaataga 


aataaataaa 


taaataaata 


6540 


10 


aataaataat 


aaagattata 


tggccaatta 


agtttgggaa 


atgcccttaa 


tactaaaaca 


6600 




gagtgtgcca 


cttgtagaaa 


atctcagagt 


ctttaatatg 


agaaaaaggg 


agaagaggct 


6660 


15 


aaccatggtt 


tctcaaactt 


atttgaccat 


ggaacccttt 


tactgcagat 


acatcaggac 


6720 


caatgttcca 


tggaacatac 


tttgagaaag 


gttattctgg 


attcttgagt 


ccaactccct 


6780 




gttggatatt 


tgaaccattc 


ctactgccac 


agctctcaca 


agtgaatgtc 


taggtggagg 


6840 


20 


tacagctcca 


gtgcccaggc 


actcactatc 


tccgtggaca 


agcagttcca 


tccaaggcag 


6900 




ctctactaag 


aagtccctct 


tcttctgttc 


tttgcagtgg 


agtccaggcc 


tgggtccagg 


6960 


25 


tattgtcctc 


ccaggtgtgc 


agagtctagg 


tgaagccagt 


ggctaacctg 


gttcaggcct 


7020 


ctgccaccga 


atacatccct 


tagtccagct 


agctgtgtca 


gacggacttc 


tgaaggcttg 


7080 




ggaaccttcc 


caagggacag 


ttcccagagg 


gacagttccc 


atgcatagct 


gggcagagtc 


7140 


30 


acaaaaagaa 


tcatacagca 


ctgtgtctat 


gagctcagct 


tctgaagccc 


aggaaagagg 


7200 




gagaccctct 


gcccactcac 


cccactaaat 


cagcggtcgt 


cactgcagga 


ccatggagat 


7260 


35 


atcaatcagg 


gggtcctacc 


cagcagccca 


ggcccagagc 


actggcgatg 


gagtaagacc 


7320 


atgcgtggtt 


ccaattccag 


gcacttcctc 


actgtgtgac 


catggccaag 


gcaggcaacc 


7380 




tctctgaacc 


ttggtttcct 


cttctatgca 


atggtgctga 


cttcctgagt 


attaagggag 


7440 


40 


qttqccaqqc 


aaaactgcct 


cctggcacag 


tgcccggcat 


acagtgctca 


cccaactgtg 


7500 




aggccccatc 


tggctctttg 


gctcttgata 


atgtcccttc 


ctttcccctc 


tacccaatcc 


7560 




aggatatagt 


cacctgggct 


atttgtccct 


ccaaacttcc 


caaaactctt 


tccctgcatc 


7620 


45 


















cctgtggaaa 


atgtggatgc 


tcaatggaac 


ttcctggatc 


cacaagtcag 


acttaaaagt 


7680 




gagaggaatg 


gccctccctt 


ttcattcatc 


caggatacag 


tctgagagca 


gcctctctct 


7740 


50 


ctcaaggatt 


tgctcattag 


ccccatcccc 


atggagaaac 


catctctgac 


ctcaatccct 


7800 




gcaagccagc 


tccctttctt 


ccccgcccag 


ctgtccactt 


cctgggacag 


aaagactacg 


7860 


55 


ttggattttc 


tcaccaagtc 


ccaggcagcc 


cccaagccct 


agtgcatccc 


acggcgtcag 


7920 


tatacactat 


accaggccct 


gtctgcaggt 


cgtgtgtcca 


gcccccttct 


aacttttttc 


7980 




cccaactcct 


ttttgtgccc 


atgagaacat 


ccagaacatt 


catttccaat 


tgttgagcca 


8040 


60 


acagtgcatg 


aaggacttaa 


gccccaggct 


ctggagccag 


cctgccatgc 


ttcaagtcca 


8100 
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agtcacttac tggttgtgtg accatgggca 
aaaatgagag tgaggacggt atttacctca 
5 tgcacgtaga aggtttatct cagtgcctgg 

cgttaaatat ggacttcctg actcagtgtt 
tttcttccct tctctccagg aggcatgggc 

10 

cacctgcctt cagtcactct tcttggacca 
attgacctgc acctctcgca cctcccaact 
15 gagctctgtc ttccactggt tgtgaagttt 
agctgtgtct ttgcaacccc agaaaggctg 
cagattattc caggtcaaat caagaccaag 

20 

ttctggaaag atctggtccc ttaaacagag 
cctcagaaat gagacgggtt ctgacggcct 
25 ccagcatcaa tgaaagcaga agccacagtt 
tcatggcaag tcctcagccc agtccagccc 
cagcccaagc tcctggctcc acaccagcac 

30 

aaggagctgg gggtgagcat ccacttccca 
gggatatgag gaggatgctg gggggcccca 
35 gctttcctgc tgaacattca tctgaactgg 

ccacacctac tactgccttg tgaggtttgg 
aggctgccag ctccaggata gatagctccc 

40 

ttctgcagga aatcctgggg agtggtcctt 
gaggacagaa gggactcacc acagccagta 
45 ccagtgaccg atgccctcac cccagccatt 

ccatcgctct gctgcacctg gtctttgggg 
acctggtggt gctgaagtct tggtatccat 

50 

aaaacacaga ccgggtgtgg gagtctgcaa 
caggatgcag ggtgggatag agagagctaa 
55 ccagtgttcc tagctctgct tcttttatct 

agcccctgtc aattctgagt cccaagagag 
gaatatgtgc atttggcaaa gaggagggtg 

60 

ccagccttct gagcagccag gctgcctctg 
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agtggcttaa 


cctttctgag 


cctcatctgc 


8160 


tagggttgga 


atgaggacaa 


aatcaggtaa 


8220 


agcagagaaa 


gtatttgtaa 


atgttagcat 


8280 


ggctcataaa 


aagtcctcgg 


tggaagatat 


8340 


accaccacag 


gaggtaggac 


aattatttcc 


8400 


gtggatggag 


actgtttaca 


aagcccctcc 


8460 


cttccaaact 


gcagtctggc 


cactttgggt 


8520 


gcctggtcct 


catttttctg 


ccccaaaagg 


8580 


tgagcagcaa 


ttactcccat 


ctccaggggc 


8640 


accaggggct 


gccagacagg 


gcaaccacag 


8700 


gtgattggca 


gaggctcacc 


caggcacaag 


8760 


tcctccccgt 


cctgcagcca 


gggcccccat 


8820 


attcccagcc 


gttgtgctag 


ggggctccca 


8880 


tggcagacaa 


gtgcacccca 


gaacacgacc 


8940 


gagaagtccc 


agaagaagag 


cagccttctt 


9000 


gggtcccagc 


agtgggaggc 


agaggagagg 


9060 


agggaagaaa 


gggaaaagag 


aaaggctcca 


9120 


aaaaggcctt 


gaagattgga 


gagaatctgc 


9180 


acagggccct 


gtcctgggga 


gaatggcaga 


9240 


acccatgggg 


cccttggcag 


cttcagctct 


9300 


cttagagagc 


ctggtctctg 


ggatccctaa 


9360 


ggtccagggc 


actagtggcc 


tcaattagag 


9420 


ctctcttccc 


cacccaggcc 


ttccacatca 


9480 


gctacctggc 


ctctatagtc 


aagaaccttc 


9540 


tctggggggc 


tgcctctgtg 


agtagaaggc 


9600 


ggcaggcctg 


ggaggccctg 


gcatttggga 


9660 


taggagtgcg 


gcagaagcga 


ggccctttct 


9720 


tagtcacaat 


ggaaatagct 


ccttccaggg 


9780 


cacttgqagg 


ctatccattc 


tcccataatt 


9840 


agtaagaccc 


agggacagct 


tcattcttcc 


9900 


gatccagcta 


caggtgccct 


gttggggttg 


9960 
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agctacaaaa acacacaggg tgtaaggtgt 
gatgggagat gacactggac cagtgcatgc 

5 

ggtctccagg gagcccaccc cattcctgac 
tccaagctcc acggcactgt cttgggcaag 
10 tgttctcact gcccacagga tggggctttc 

cagtgcttat tcaccctgtg ctcccaggga 
ggaaagaatg attgcagagg tgctcccaga 

15 

aactgcaaga ttcaagaaaa gagaaacagt 
agcgatggtt tgggtttgta aaatgaggga 
20 gcagggcaca gcttggtcca aggtgtagag 
gggaagatgt gaggtggggc agcttcttgg 
ctttgctgga ggaatagtat tccctgcccc 

25 

aataggggtg ggggtgggaa tttgtgggtt 
ttcaaatctg tcctgcagtt tctcatttca 
30 tctaaaactt acctggtgag tgggcacact 
taactttgtt aagggcatct gtctgtaggg 
ctaagagcag gctcttgcct tcaggacagc 

35 

cagacaggct gacagacttg gctgtgggag 
ttccagtgca gtgaggcaga gaaggaaatg 
40 gacacaactt tgcttcctct caaatctact 

caagccccag cctccagctg gatttgcatc 
gaacggtggt cctccctttt ctgcccgtgc 

45 

cctgcacaca tggctcccat ctttgactta 
ttgtcctccc ggagatcttc cctgacctcc 
50 tggacttacc tgtctctact gtacactaaa 

gtgatagccc aaggtggccc aagtaggcac 
gtggaggtgg atggatgatg gaaggaacat 

55 

tcccacctcc cctccttctg cccccgaagc 
tcagcctggc taggctgtgg ttgctcccta 
60 tggttgtgca gtttacccag cagaggcagg 
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ggctcctgtc 


ctcccaaggc 


catcagccag 


10020 


accatttccc 


agaagctcta 


cattccttca 


10080 


tcccaggtgg 


agtcctgcct 


atgcactaag 


10140 


tatcccaatc 


cccctttcac 


atcagtccac 


10200 


actgcacagt 


gatagaactt 


tccacacatg 


10260 


ctagcccata 


gtaggtgctc 


agtgcacatt 


10320 


gaccttggaa 


gcaagaactc 


atatcctccc 


10380 


cggggtggag 


gggatagtgc 


cactgatccc 


10440 


agcaggaggg 


tctaggaggg 


agttctaagt 


10500 


atgggaggag 


gtatggaggt 


ttgtgcacat 


10560 


atgggccaca 


tggcaggcaa 


gaggagctca 


10620 


actggccgcc 


aagggttgat 


tttcttggag 


10680 


ttgaggaacc 


ccagcttctc 


tcctttgact 


10740 


gggatcttgg 


cgataacaat 


gaagaccttt 


10800 


gagatcattc 


tcttccatct 


gagcatggca 


10860 


gtgattgtga 


gggcacttct 


ccaggacttg 


10920 


agaggatgtc 


caagcagaag 


ggtccagatg 


10980 


gaaaggaagc 


accatctgaa 


gcttctctct 


11040 


gctcccaggc 


accctgaccc 


atccatctag 


11100 


gtgtcttaat 


cttaatcacc 


acatagcctg 


11160 


tacttctcct 


ctctaagagc 


caggcagatg 


11220 


tggccactgc 


ccaaacacct 


tgccctctcc 


11280 


ctcctgacca 


ctcaagacaa 


cctgtatgtc 


11340 


atggctgcct 


tccattctcc 


cacaactctg 


11400 


ttatttctgg 


gcagaatgtg 


cccgagcata 


11460 


agagtagaca 


cctagcacaa 


gactggatgg 


11520 


ccctgcaaat 


cacagctcac 


acccaggtct 


11580 


ccaagccagg 


tccttcccta 


tttgtgtccc 


11640 


ctttctcaga 


gggcatacag 


aatagaagga 


11700 


agatgcttga 


caaagacgtg 


cccattcccc 


11760 
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agacctcacc ctcctgggct gagccccttt 
tgagcataag gacactctta ctgtaaagaa 
5 catatgtgca tagtctttcc tcactaggag 

gatgcttaac ccgagaggac gatgaataca 
agaaaatggg ctggtactgg gacatgtgag 

10 

tttctggtgg ggaaggggct ggatgagggg 
catagctgat gtaggatggt cggggctggc 
15 ttattgcaaa gctcttggat ctgaaggcag 

aggcaagggc cagctggacc ctcctcagtg 
tgtgaggggc tgaggggcca gtgacaggca 

20 

gtctctcctg aggctacaga ggaagctact 
gtctccaccc tttggacagc tgttaccttt 
25 atgacaaacc tcatcagcct cttttgcgtg 
ctctttctgg agagcccatt tgagtccccg 
agtacccagg cctggagggc cctctgactt 

30 

caggaggttt ccagctcaca gaaagcccct 
tttgccaacc tgtttctgtc ccctacaagg 
35 actggtcagc atcactgaat ccaagtccac 
ggggaggagc ctgcctgagg tcaagcagca 
aaactgctga tttcccagcc agggctctaa 

40 

gggagcctcc catcagcctt gcaaagctgg 
ggagtgaaca cggaggtcca gagaggctag 
45 ctcttttcac catatcactg ctgcctctga 

gccaggctcc aagaattctt ttttcttttt 
ctcttgtcac ccaggctgga gtgcagtggc 

50 

ccagattcaa gcaattctcc tgcctcagcc 
cacagccacc accacgccag gctaattttt 
55 gtcggccagg caggtctcga actcctgacc 

agtgctggga ttacaggcgt gagccaccat 
ttattcctta tctcctccca gcacatacac 

60 

tgtgcgagca tgcatggaca cacatgtgcg 
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tagaaccaca 


ggcctcttgc 


tttgtatacc 


11820 


acccattcca 


gtataaaagg 


ctacagattc 


11880 


gatttgcact 


tggcaagagg 


ctcttggaat 


11940 


ggcatttcag 


gaaccatggc 


aggtgggcgg 


12000 


gatagggcac 


tttcagagct 


cctcctgacc 


12060 


tcacctgggt 


gtcgaatccc 


tgacacttgg 


12120 


tttgagcagg 


aaatggcaat 


agggcatcct 


12180 


ctcttgggtt 


ctcccaattt 


gtcccataga 


12240 


gtcacacagg 


agcagctcag 


ctcaatcctg 


123 0 0 


gagggggcct 


tgggcccagt 


gacaggcaga 


12360 


ggcaagcatg 


aggggaggac 


ctagccccag 


12420 


atttttcacc 


tatagaagat 


gttgtgcctg 


12480 


ctgtctggcc 


tcttcgtcat 


ctccaaggat 


12540 


atctggagaa 


tgtaccccaa 


ctccacggtg 


12600 


cagcgaaggt 


gctgccttgg 


atttggccaa 


12660 


cctggagccc 


aggtgtgggc 


tgctgggccc 


12720 


tgtggtggaa 


aggtgggtgc 


tgggctgctg 


12780 


ttttatagat 


tgggaaatgg 


ggccagagac 


12840 


acttgacggc 


aaatccagga 


ccacagccta 


12900 


gcatggtgtg 


cctggtggcc 


cctgtgatcc 


12960 


ccaagtccac 


tttgttaccc 


tcatcttctg 


13020 


attagtgggt 


ctttgactcc 


aagcctggtg 


13080 


agcagaaggc 


agaccactgg 


gtgcctctgg 


13140 


ttttggaagg 


ggcgggggaa 


cggagtctca 


13200 


acaatcgagg 


ctcactgtaa 


cctccgcctc 


13260 


tctcgagtag 


ctaggattac 


aggcacccac 


13320 


gtatttttag 


tagagatggg 


gtttcaccat 


13380 


tctagtgatc 


ctcccgcctc 


agcctcccaa 


13440 


gcccacaagg 


gttctgacca 


tagtggtttg 


13500 


acacgtgcat 


acacatgcac 


atcacatgca 


13560 


tgcacacaca 


cacatataca 


tatgcttccc 


13620 
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catcaaatgc atgtgatact atgagatgac tcccagctgc acggctcttc ccctcctctg 13680 

ctgtaaaaca tgctccctca ctctgggact aggtaccccc atctgtcaaa taggcttgga 13740 

tcagatgacc ctaaggctac cttcattctt accagggacc catgagattc tccagccctc 13 800 

aaacgttagg ccaatactca ccaaagaccc atatgtgcca ggcattgagt gtagggcagt 13 860 

10 gaccaagaca gaccaagtcc ccgtcccaca gagacccacc ctgccttccc cacgcacacc 13920 

tgggagctga aaccccacct gccctctgct gtggatctta gcaaggaccc ctgcctggct 13980 

aagcccacgt gatgcgtgca gacacccagc gaggcctcac cctgccttcc ttccctctac 14040 

15 

cctctcaggt ccacatccag aggctggagc tggccttgct ctgcttcact gtcctagagc 14100 

tcttcctgcc agtgcccaca gctgtcacag cctggagagg ggactgccca tctgcaaagg 14160 

20 taagacaagg gcttgtcttc ccaggaagac aaaaaatggg ggcatgaggc agcagagagc 14220 

tcagaaagga aagatatgtg ggggttacaa gaggagcgtg tttcattcat tcctcacgtt 14280 

gctgagcgtt cactaagtga cagcccacga tccagtaggg gagccaaggc acgggggcaa 14340 

25 

gagaggaggc caatccaagg agaaaagggt gaggctgcca gagaggaaac tccatgggtg 14400 

cccagaacaa gaggctactg tgctgccagc tctgaaggcc ctggggaaat tgcatcacag 14460 

30 aggcccatga tgaccgggca ggattcggac ccacaggcat aggtggaaga gagactttca 14 52 0 

gcagagcaag aagagcacga gcactggctt ggagacagga cagagaaggc ctggggaagc 14580 

tatggtgtgt gtggagtttg gctgtagtgg aaacaagtga cagagtcatc aaagattggg 14 64 0 
35 ~ ~ ~ "* ^ " 

catttaagga attgcaggga ctctattaag catcaatcga tgcatccatt gatggatgga 14700 

tggatggatg atgtatggat ggatatagac gtgcagatat acagataaat atagctgtat 14760 

40 ttaactttat cttctcaata accctttttc ttttgagact gagtctcact ctgtcaccca 14820 

ggctggagta cagtggcacg atctcggccc actgcaacct ccgcctcccg ggttcaagca 14880 

attctcctac ctcagcctcc caagtagctg ggactacagg catgtgccac catacctggc 14940 

45 

taatttttgt atttttttaa gtatagatga ggttttgcca tcctggccag gctggtctca 15000 

aactcctgac ctcaagtgat ccaccaccca tggcctccca aagtgctggg attacaggca 15060 

50 tcagccatca tgcccagccc tcaacaaccc tttgaggtac ttacattctc atactacaga 15120 

ctagcaagct gaggtccaga gaggtcacat cacttctcaa ggcgaagcag ccaattagga 15180 

aatccaggac tcagcctgag ctatgtccaa cccacagtct tgcccttgtc tacccatggg 1524 0 

55 

gatggcatag ggctagatat attttaaaat agaaggcaag tctctgctca ccccaaagat 15300 

cctgaacagt tcaggataag gatgttttat aaaagaaaaa aaataaataa aaaataaaga 15360 

60 ccctgaccag aagtgatctc taaggctcat ttctttaaag aaaaaaaaat catattacaa 15420 
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tctcaacttt agaacattat ttaaaaaagt 
cactgatttt catgttagtg acctcatcac 
5 gttgatggcc tctgagtttc atttaacatc 

gctgcctgac aggaagtctt ctcattaaga 
actgaggcca accccacttt taaggatggg 

10 

tatgagcaat aattgttcct atttattgac 
ctgcaggtgg ctgtattcat tttctattga 
15 cttaaaatga cacctgtatt accctacagc 

aagggctcaa atcaaggtgt gtcgccagga 
tctctttcct tgccttttcc agcttctaga 

20 

ttcttccacc ttcaaagcca gcatcacaac 
cctttaagga cccttatcat catactgggc 
25 tctcaaagct gattggaaac cttaattcca 
accagagatt agaacatggg tgtcttctgg 
agatgtcctt gaatctcacg gccattctgg 

30 

ggaaaaaatg aaccttagaa gagttacgtg 
ccatgccagg ttccccaatg ggccctgtgc 
35 caggctctcc agcactgagt tggtgccacc 
agcaatcatt ggcagacact gttggcatta 
agggacttgg gtcatggccc agcccccctc 

40 

ctggtcacca ggacactggt agcgggacgg 
tgggcctccc cctactccct tttcaagaac 
45 tatggaagtc gcttcgtctc tcaagcctca 
ttcagaagtg aactagcaga gaagtgtttc 
cagcaggggc ggcgactttt ctttcgcaga 

50 

cattgcatct caaaggcctg ccggtggagc 
gcgacgcaca acacaagcaa catcagaggt 
55 tcagaaccca cgcttggccc cttctccctg 
tgtaaggggc aaggggctaa cagtgctgca 
caaggcagca atggtcctat cccctcaggc 

60 

catctcagca cctgacttgg ttgaccagaa 
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gccaaatccc 


ccataattct 


tcaagtttga 


15480 


acaccaagtg 


ttcacataat 


gcatatataa 


15540 


aagcagagaa 


ggtgtctttg 


ttaattccta 


15600 


catcagaaag 


ccagatgggg 


gtcttgagtc 


15660 


gaaaactggg 


gccccatacc 


tagctgcagc 


15720 


ctttgctgtg 


ttccaggcac 


tgtgctagtg 


15780 


tgttgtagta 


aattatcaca 


aatttggtgg 


15840 


tctggaggtc 


agcggtttga 


aatgagtctc 


15900 


ctgcatccct 


tccgggggca 


tcaaaaagaa 


15960 


ggctgcctgc 


attccttggc 


ttgtggctcc 


16020 


atcctccaat 


aaccctgtct 


ccctcttcta 


16080 


ccagagagat 


aatccaggag 


aatctcccaa 


16140 


tctgcaacct 


taattcctct 


ctgccgtgtg 


16200 


gggccgtgct 


tctccctact 


gcactagcta 


16260 


gacgcaggtt 


ctgcttttcc 


attttaaaga 


16320 


acttcccatg 


gtctcacagc 


cagtaagtgg 


16380 


cctccccatc 


ggtaaggatg 


gtgccaggga 


16440 


ctcatcacta 


atcagggatg 


gaggtgtggc 


16500 


gacctgcccc 


tagagtgtat 


cctggctaga 


16560 


aggaaaccca 


acagggtgag 


tgacttgcac 


16620 


gaaccagagc 


cccatccagt 


gactcccatg 


16680 


agggagggtg 


gagggagtca 


cacactggtc 


16740 


gtttccccac 


attcatggag 


aacttgagtc 


16800 


ggaagcagcc 


agtcgctcca 


cagctgtgag 


16860 


atgatgatgc 


atgccttgtt 


ccgaatacac 


16920 


ccccgccatc 


ctaccagagt 


gtgattcaag 


16980 


gaagaggttt 


ggccctggct 


ccccagacct 


17040 


gcatagatag 


aaaccagttc 


ccaggggagc 


17100 


gctagaactg 


cctggcctgg 


gagtggtagc 


17160 


actagaggaa 


ctcccagggc 


cccccaggaa 


17220 


ctacctcagc 


tgaacctggg 


cgcaaccagg 


17280 
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acccatcgcc ttccctgcca gcctccagag 
cccccatgcc ctctggcctc ccttgcctgc 

5 

gatataaggg cagccaggaa ggaaggtctc 
ctcaacc.tac taagacatct ctgagggagc 
10 agggcagctg gggagttgag tcccatagaa 

ccctctgcat gaatcttttt gccctcctgg 
tgcaaggatg acatgatcca ggagtctccc 

15 

aaggttcaca gttgtgacat cgaggttgcc 
tacttgtgct aagtaaataa tgtgagggag 
20 cagtcatctg aggaatagtc ttcaggactt 
gttttttttc cacactgatt cttttttcta 
aagatatagg aaatcagtat ctttccataa 

25 

gtgaaataaa acgcaatttt cccctccaac 
cttgctagct ctgccccagc caccatggct 
30 acccctgcct ctgagccatc acccttgccc 

atttgcagag ctcactcttt gaatctttac 
tatttaatct caatgtcatt taatcctccc 

35 

taatatctcc atttcatgca ggctcagaga 
agtcactgac ggagctgcga tctggaccca 
40 atgcccaaac ttggttggag catagcccct 

gatgagattt gcacatacaa aaggctagag 
cagctcctgg gaacgctgtc ctctcagata 

45 

atctgtggtg gcccagattg ttttgttttg 
ttgagatgga gtctcgctct gttgcccagc 
50 tgcaatctcc gcctccctgg ttcaagtgat 

actataggca cacgccacca cgcccagcta 
caccatattg gtcagcctgg tctcaaactc 

55 

tcccaaagtg ctgggattac aggcgtgagc 
ccgagatagc agtctgctct ctactgcctt 
60 gtatcatttg cccttgcaca gaatatccct 
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caaacccctg 


ggcctgcagc 


cccctggtgc 


17340 


cttctctgat 


caatggtgga 


tgcacagggg 


17400 


cagagcggca 


caggcagggg 


cttcccttgc 


17460 


ctgtgtgtct 


gaatccctgg 


atttgggctg 


17520 


actgtgtaca 


gttcttccct 


ctgaactaga 


17580 


acctcagtgt 


ctccatctgt 


aaagtggagg 


17640 


agcttgaaag 


ctctgagatc 


aggagaattc 


17700 


agtaaaactt 


cagcctcgga 


atttcaaacc 


17760 


agggacctgc 


cttcctcatt 


ttcctaattt 


17820 


ttgcttattt 


gtgagctgca 


atgtattctg 


17880 


tattatttta 


aaggggcact 


ttgtagcagc 


17940 


acagaagcaa 


accccaaaaa 


taacaaacat 


18000 


cccgtggtct 


cacccactgc 


ttcccttcct 


18060 


tccttgaggt 


tccacagaca 


ccccagggac 


18120 


ttcctccacc 


tggattctct 


tcccccaggt 


18180 


tgggccactg 


actctgagcc 


aagtaaatgc 


18240 


atcgatctgt 


gacctagctg 


ctgttgctat 


18300 


agttaagcaa 


gttgccccgg 


acacatggat 


18360 


gactgcaaac 


tgaagagcca 


ctgcctgaca 


18420 


gctctcccaa 


agttgcactt 


tcactgggaa 


18480 


cgatggtcta 


tacagcaaag 


tcagccctca 


18540 


agccatttct 


tacatagttg 


atggctcgat 


18600 


tttcgctttg 


ttttgttttc 


ttttgttttg 


18660 


ccagagtaca 


gtggcactat 


ctcagctcac 


18720 


tctcctatct 


cagcctccca 


agtagctggg 


18780 


atttttatat 


ttttagtaga 


gacaggattg 


18840 


ctgacctcag 


gtgatccacc 


tgcctcagcc 


18900 


cactgtgccc 


agcccaggtt 


ttgaagttgt 


18960 


ataaaatccc 


tgtgtgaagg 


gatgctctca 


19020 


ggggtttgag 


gttctttgaa 


ttctccctct 


19080 
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ttgtcatctc tttcgctgcc acttctggct gtggtcacta gcttggccat agcacctctc 19140 

ttctccactt ctgatctgct gcttctaacc ttctatagat tgcagctggc tttaaaatag 19200 

5 attgtaaagt gtaaggcatt aggttctgag acagcggcag agagagccat gcaaatgttt 19260 

aggacaaccc agtctttctt tttttttttt tttttttttt ttt 19303 
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